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7A.1  Executive  Summary 

In  the  past  year  we  have  made  progress  in  several  areas  relating  to  problem 
solving  in  temporally  complex  domains.  The  principle  results  are  the  following: 

-  an  extension  of  the  interval-based  theory  of  time  (developed  in  previous  years 
under  RADC  funding)  to  allow  two  different  forms  of  time-points; 

-  a  generalized  model  of  plan  recognition,  both  as  a  formal  theory  and  as  a 
family  of  practical  recognition  algorithms  (Henry  Kautz,  Ph.D.,  1987); 

-  the  formulation  of  two  distinct  forms  of  abstraction  useful  for  planning  and 
the  investigation  of  their  properties  (Tenenburg,  1987); 

-  an  extension  to  the  STRIPS  representation  of  action  that  allows  indirect 
effects  and  effects  that  are  dependent  on  context  (Weber,  to  appear); 

-  the  development  of  an  initial  version  of  the  RHET  representation  system, 
extending  the  HORNE  system  (see  Miller  &  Allen,  1987). 

Each  of  these  projects  is  described  in  more  detail  below.  Another  important  event 
this  year  is  that  tne  HORNE  system,  completed  in  1986,  has  been  distributed  to  over 
50  different  research  sites  in  the  last  year,  including  places  such  as  Advanced 
Decision  Systems,  SRI  International  and  MCC. 

In  the  project  on  formal  temporal  models,  we  axiomatized  a  theory  of  time  in 
terms  of  intervals  and  the  single  relation  MEET  and  showed  that  this  theory 
subsumes  Allen's  interval- based  theory.  We  then  extended  the  theory  by  formally 
defining  the  beginnings  and  endings  of  intervals  and  showed  that  these  have 
properties  we  normally  would  associate  with  points.  We  distinguished  between 
these  point-like  objects  and  the  concept  of  moment  as  hypothesized  in  discrete  time 
models.  Finally,  we  examined  the  theory  in  terms  of  each  of  several  different 
models. 

Henry  Kautz  completed  his  Ph.D.  thesis  which  involved  the  first  formal 
description  of  the  plan  recognition  process  and  completed  an  initial  implementation 
of  a  plan  recognizer  that  uses  action  abstraction  to  reduce  the  combinatorial 
explosion  found  in  simple  plan  recognition  algorithms.  The  result  is  a  family  of  plan 
recognition  algorithms,  each  of  increasing  complexity  and  cost.  Most  notable  is  this 
system’s  ability  to  handle  observed  actions  that  are  actually  steps  in  two  plans  being 
executed  simultaneously,  actions  that  are  only  vaguely  specified,  and  the  ability  to 
accept  observed  actions  in  any  temporal  ordering. 

We  have  completed  the  first  phase  of  the  implementation  of  the  RHET  system,  a 
knowledge  representation  language  based  on  our  previous  representation  system, 
HORNE.  This  work  included  design  of  the  basic  system  including  all  the  features  of 
HORNE  extended  with  a  version  of  conniver-like  contexts,  full  equality  reasoning  in 
contexts,  a  TMS  and  an  extended  type  reasoning  system.  This  system  will  be  tested 
in  some  realistic  application  in  natural  language  and  planning  applications  in  early 
’88. 

We  also  explored  different  uses  of  action  abstraction  in  planning  systems.  Josh 
Tenenburg’s  Ph.D.  thesis,  to  be  completed  in  early  1988,  defines  two  forms  of 
abstraction:  one  found  by  simply  relaxing  the  conditions  under  which  actions  may 
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apply,  and  the  second  by  using  a  predicate  mapping  framework  to  define  analogies 
between  actions.  With  the  first,  he  developed  a  theory  in  which  it  can  be  shown  that 
if  a  fully  specific  plan  exists  for  some  problem,  then  there  exists  an  abstract  solution 
to  an  abstraction  of  the  problem.  While  this  might  seem  an  obvious  property,  many 
previous  models  of  abstraction  do  not  guarantee  this.  With  the  second  form,  based  on 
analogy,  the  opposite  property  can  be  proven:  an  abstract  solution  to  a  problem 
guarantees  the  existence  of  a  fully  concrete  solution. 

The  final  area  concerns  the  representation  and  use  of  actions  when  reasoning 
about  in  a  complex  world.  Our  approach,  based  on  the  notion  of  action  generation, 
has  important  improvements  over  the  pervasive  STRIPS  representation  of  action  as 
well  as  recent  more  formal  theories,  including  the  treatment  of  indirect  effects,  low- 
level  actions,  and  action  co-occurrence.  The  preliminary  results  are  encouraging  and 
this  project  will  be  completed  as  part  of  Jay  Weber’s  Ph.D.  dissertation  (expected 
early  1989).  Future  work  will  include  a  more  general  representation  of  action  and 
time  that  allows  actions  to  have  durations,  thereby  allowing  serial  action 
composition  and  requiring  the  generalization  of  preconditions  to  include  properties 
that  must  hold  during  the  action. 


7A.2  Introduction 

The  unifying  theme  underlying  all  of  the  sub  projects  is  that  of  producing  new 
knowledge  representation  formalisms  that  extend  the  range  of  situations  in  which 
problem  solving  systems  can  be  applied.  The  primary  extension  examined  so  far  has 
been  the  use  of  temporal  knowledge.  By  introducing  time  explicitly  into  the 
representation,  a  wide  range  of  new  situations  can  be  represented  including  ones 
where  more  than  one  action  may  occur  at  a  time  and  where  external  events  may 
occur  beyond  the  control  of  the  planner. 

In  addition,  using  time  explicitly  in  the  representation  allows  for  a  variety  of 
formal  models  of  action  that  allow  the  first  precisely  defined  model  of  the  plan 
recognition  process,  as  well  as  a  formal  definition  of  what  it  means  for  one  action  to 
be  an  abstraction  of  another.  With  these  advances,  confusions  made  in  earlier 
planning  systems  can  be  identified  and  corrected,  and  more  general  systems  can  be 
developed. 

In  the  rest  of  this  report  we  examine  particular  results  made  this  year  in  the 
formal  representation  of  time,  including  time  "intervals”  and  time  "points”,  the  first 
formalization  of  the  plan  recognition  process,  the  definition  of  two  distinct  types  of 
abstraction  and  an  investigation  of  their  properties  and  a  new  representation  of 
action  that  allows  context-dependent  effects.  In  addition,  we  have  completed  the 
first  stage  of  the  RHET  system,  a  practical  knowledge  representation  system  that 
not  only  has  a  formally  well  denned  semantics,  but  also  has  a  well-defined 
inferential  capability. 
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7A.3  Time-Oriented  Problem  Solving 


7A.3.1  Time  Points  and  Moments 

Interval-based  temporal  reasoning,  since  its  introduction  by  the  PI  five  years  ago, 
has  proven  to  be  a  useful  formalism  in  many  different  areas.  The  purpose  of  this 
current  project  was  to  provide  a  concise  formal  theory  for  the  interval  logic  and  to 
investigate  the  relationship  between  interval  and  point  based  temporal 
representations.  In  conjunction  with  Pat  Hayes,  now  at  Xerox  PARC,  a  new  theory 
has  been  developed  with  the  following  features: 

•  The  introduction  of  time  periods  as  the  basic  building  block,  which  does  not 
commit  to  either  being  an  interval  or  a  (certain  form  of)  point. 

•  A  concise  axiomatization  (five  axioms)  that  completely  capture  the  inferential 
capability  of  interval- logic. 

•  The  development  of  two  distinct  notions  of  point-like  time  periods 

-  Points,  which  are  endpoints  of  periods  and  can  be  shown  not  to  be 
periods  themselves  but  are  useful  for  representing  times  of  transition 

-  Moments,  which  are  non-decomposable  periods,  and  are  useful  for 
representing  "instantaneous”  events  such  as  flashes,  clicks,  etc. 

•  Demonstrating  that  our  logic  is  consistent  with  various  models  of  discrete 
time,  continuous  time,  or  models  that  mix  the  two. 

Two  of  the  papers  in  the  appendix  present  this  work  in  detail. 


7A.3.2  A  Formal  Theory  of  Plan  Recognition 

Research  in  discourse  analysis,  story  understanding,  and  user  modeling  for 
expert  systems  has  shown  great  interest  in  plan  recognition  problems.  In  a  plan 
recognition  problem,  one  is  given  a  fragmented  description  of  actions  performed  by 
one  or  more  agents,  and  expected  to  infer  the  overall  plan  or  scenario  which  explains 
those  actions.  This  project  developed  the  first  formal  description  of  the  plan 
recognition  process. 

Beginning  with  a  reified  logic  of  events,  we  designed  a  scheme  for  hierarchically 
structuring  a  library  of  event  types.  A  semantic  basis  for  non-deductive  inference, 
called  "minimum  covering  entailment”,  justifies  the  conclusions  that  one  may  draw 
from  a  set  of  observed  actions.  Minimum  covering  entailment  is  defined  by 
delineating  the  class  of  models  in  which  the  library  is  complete  and  the  set  of 
unrelated  observations  is  minimized.  An  equivalent  proof  theory  forms  a 
preliminary  basis  for  mechanizing  the  theory.  Equivalence  theorems  between  the 
proof  and  model  theories  are  presented.  Minimum  covering  entailment  is  related  to 
a  formalism  for  non-monotomc  inference  known  as  "circumscription”.  Finally,  the 
thesis  describes  a  number  of  algorithms  which  correctly  implement  the  theory, 
together  with  a  discussion  of  their  complexity. 


7A-4 


The  theory  has  been  applied  to  a  number  of  examples  of  plan  recognition,  in 
domains  ranging  from  an  operating  system  advisor  to  the  theory  of  speech  acts.  The 
thesis  shows  how  problems  of  medical  diagnosis,  a  similar  kind  of  non-deductive 
reasoning,  can  be  cast  in  the  framework,  and  an  example  previously  solved  by  a 
medical  expert  system  is  worked  out  in  detail.  It  has  also  been  ported  to  BBN  Labs 
for  use  in  their  FRESH/OSGP  system  (Naval  Search  and  Rescue  domain). 

The  analyses  provides  a  firm  theoretical  foundation  for  much  of  what  is  loosely 
called  "frame  based  inference”,  and  directly  accounts  for  problems  of  ambiguity, 
abstraction,  and  complex  temporal  interactions,  which  were  ignored  by  previous 
work.  The  framework  can  be  extended  to  handle  difficult  phenomena  such  as  errors, 
and  can  also  be  restricted  in  order  to  improve  its  computational  properties  in 
specialized  domains. 


7A.3.3  Abstraction  in  Planning 

The  research  conducted  this  year  focused  on  issues  involving  abstraction  in 
planning.  In  order  for  automated  agents  to  be  of  use,  there  is  a  need  for  them  to 
model,  reason,  and  understand  the  world  at  a  level  sufficient  to  satisfy  abstract  goals 
within  flexible,  dynamic  environments.  It  is  clear  to  most  researchers  in  artificial 
intelligence  that  this  will  require  the  use  of  expressive  syntactic  languages,  e.g., 
first-order  logic,  in  order  to  fully  meet  these  knowledge  requirements.  As  a 
consequence,  however,  the  computational  tasks  associated  with  manipulating  these 
languages  is  overwhelming.  As  such,  there  is  an  increasing  awareness  that  agents 
will  require  the  use  of  simplified,  abstract  models  of  the  world  to  solve  a  great  deal  of 
their  problems  quickly,  and  that  much  of  their  expertise  will  be  reflected  by  their 
choices  of  which  simplified  models  to  utilize.  It  is  not  always  practical  for  human 
system  designers  to  pre-specify  which  simplifications  will  maximize  the  agent’s 
effectiveness.  Therefore,  it  is  important  to  define  in  a  formal  sense  the  exact  nature 
of  the  abstraction  process,  so  that  eventually  automated  agents  can  perform  their 
own  abstractions.  It  is  toward  this  end  that  this  year’s  work  was  directed. 

Abstraction  typically  involves  removing  details  from  a  model  in  such  a  way  that 
the  remaining  representation  continues  to  have  internal  coherence,  as  well  as  a  clear 
relationship  to  the  domain  that  is  being  modeled.  My  work  centered  around  defining 
two  different  types  of  abstraction  within  a  symbolic  planning  representation, 
STRIPS.  STRIPS  is  a  representation  that  views  the  world  as  being  in  a  particular 
situation  at  each  moment.  The  actions  of  the  agent  are  modeled  by  operators  which 
can  be  applied  to  a  situation  satisfying  its  preconditions  in  order  to  have  the  desired 
effects. 

Initially,  work  was  performed  in  order  to  provide  STRIPS  with  a  model-theoretic 
semantics.  Although  this  representation  has  been  extensively  used  for  over 
decades,  it  lacked  a  clear  statement  of  either  its  syntax  or  semantics.  A  first-order 
semantics  was  provided  by  specifying  a  set  of  rules  that  translate  a  STRIPS  system 
into  an  equivalent  set  of  sentences  in  first-order  predicate  calculus.  The  equivalence 
is  demonstrated  by  proving  that  the  translation  has  a  semantic  model  in  exactly 
those  cases  in  which  the  STRIPS  system  will  never  generate  an  inconsistent 
situation  through  the  application  of  sequences  of  operators  whose  preconditions  are 
satisfied. 

The  first  type  of  abstraction  involves  simplifying  the  model  by  relaxing  the 
conditions  under  which  the  operators  can  be  applied.  This  work  is  an  extension  of 
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work  by  Sacerdoti  at  SRI  in  the  early  1970’s.  Problems  associated  with  this  are  that 
it  is  difficult  to  ensure  that  the  world  model  still  has  coherence.  In  fact,  the  naive 
approach  suffers  from  just  these  problems.  My  solution  was  to  consider  the  world  as 
being  composed  of  causally  partitioned  subtheories,  each  of  which  bears  on  the 
operators.  In  relaxing  constraints  on  the  preconditions,  one  does  so  only  along  these 
partition  boundaries.  For  instance,  one  can  partition  the  set  of  sentences  relating  to 
the  robot  from  those  relating  to  the  objects  that  it  manipulates,  and  in  this  way 
suppress  the  position  of  the  robot  in  searching  for  an  abstract  solution  to  object- 
moving  problem.  In  this  way,  one  is  able  to  abstract  in  much  the  same  way  as  in 
Sacerdoti’s  system,  and  in  addition,  one  is  guaranteed  that  the  models  one  generates 
at  abstract  planning  levels  are  never  inconsistent.  In  addition,  it  is  shown  that  these 
abstract  systems  have  the  upward-solution  property,  which  states  that  if  there  exists 
a  solution  to  the  problem  in  the  original  problem  space,  there  exists  an  abstract 
solution  to  an  abstraction  of  the  problem. 

The  second  type  of  abstraction  involves  abstraction  by  analogy.  In  particular,  the 
predicates  used  to  describe  objects  and  relationships  can  be  mapped  into  abstract 
predicates  to  reflect  similarity.  So,  for  instance,  bottles  and  boxes  can  be  mapped  to 
containers,  with  the  axioms  that  appear  in  the  abstract  theory  being  only  those  that 
are  common  to  the  original  predicates.  Thus,  one  cannot  conclude  that  containers 
are  made  of  glass,  since  this  is  only  true  of  bottles,  although  it  can  be  concluded  that 
all  containers  hold  things.  These  analogical  mappings  are  extended  to  operators,  so 
that  those  operators  performing  similar  actions  on  boxes  and  bottles  (getting  stuff  in, 
removing  stuff)  can  he  considered  an  abstract  action  on  containers.  Planning  can 
then  proceed  at  the  abstract  level  by  attempting  to  first  solve  the  problem  by  the 
abstract  operators  on  the  abstract  world  description.  If  a  plan  is  found,  each  abstract 
operator  is  specialized  by  one  of  the  operators  that  maps  to  it.  As  in  the  previous  type 
of  abstraction,  particular  care  is  taken  in  ensuring  that  consistency  is  preserved  in 
the  abstract  spaces.  In  addition,  it  is  shows  that  this  type  of  abstraction  has  the 
downward-solution  property,  which  states  that  if  there  exists  an  abstract  solution  to 
an  abstraction  of  the  problem,  then  there  exists  a  solution  to  some  specialization  of 
the  problem.  There  is  also  an  intuitively  appealing  model-theoretic  interpretation  of 
the  abstraction.  For  every  semantic  model  M  of  the  original  problem  space,  there 
exists  a  model  M ’  of  the  abstract  space  where  the  extension  of  each  abstract  predicate 
in  Af’  is  the  union  of  the  extensions  of  the  original  predicates  in  M.  So,  for  instance, 
for  any  model  of  a  theory  containing  bottles  and  boxes  (and  perhaps  other 
containers),  there  exists  a  model  of  the  abstract  theory,  where  those  objects  which 
are  containers  are  exactly  those  objects  which  are  either  bottles  or  boxes  or  in  the 
extension  of  one  of  the  other  predicates  from  the  original  theory  that  maps  to 
container. 


Expections  in  1988 

The  planned  research  is  to  extend  some  of  the  issues  raised  in  my  dissertation  on 
Abstraction  in  Planning.  In  particular,  there  is  little  formal  analysis  that  has  been 
performed  that  quantifies  the  performance  gains  that  can  be  made  in  using 
abstraction  spaces  in  search.  The  work  that  has  been  performed  is  either  of  a 
heuristic  nature  or  is  within  weaker,  state-space  representations.  This  also  points 
out  the  need  to  specify  those  strategies  that  exploit  the  presence  of  abstractions. 

One  of  the  focuses  of  my  dissertation  was  in  maping  first-order  theories  of  a 
domain  to  simpler  first-order  theories.  Another  approach  that  I  will  be  exploring  is 
to  map  into  a  different  representation  language  in  order  to  exploit  the  better 
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performance  associated  with  solving  problems  associated  with  the  class  of  problems 
expressible  in  this  language.  For  instance,  suppose  that  one  has  a  first-order 
representation  of  a  robot’s  environment.  It  is  to  the  agent’s  detriment  to  use  a 
theorem-prover  to  perform  path  planning  through  a  set  of  rooms  connected  by 
doorways  if  the  agent  has  available  to  it  a  graph  structure  representing  the 
connectivity  of  the  rooms,  since  graph  searches  are  almost  trivially  easy.  The  task, 
then,  is  to  specify  how  an  agent  can  recognize  the  presence  of  subparts  of  its  domain 
model  represented  in  an  expressive  language,  such  as  first-order  logic,  that  will  map 
into  a  simple  representation,  such  as  graphs,  upon  which  very  fast  algorithms  are 
known  to  exist. 


7A.3.3  Categories  and  Planning 

Extensive  psychological  work  has  been  done  on  the  content  and  function  of 
categories  in  our  perception  of  the  world.  Recently,  the  emphasis  has  shifted  from 
looking  for  categories  implicit  in  the  objects  themselves  to  categories  formed  by  the 
interaction  between  the  agent  and  the  objects.  This  has  been  loosely  discussed  as 
"goal-based”  or  "context-dependent”  categorization.  This  project  was  based  on  the 
observation  that  an  agent’s  appreciation  of  the  relevance  of  object  properties  to  its 
goals  is  what  planning  is  about.  We  showed  how  functional  categories  in  the  form  of 
lambda  expressions  can  be  extracted  from  the  preconditions  of  valid  plans  to  achieve 
the  same  goals.  For  example,  although  pens  and  typewriters  have  remote  physical 
similarities,  they  both  take  part  in  plans  that  create  a  written  note.  More 
importantly,  either  may  fulfill  the  same  role  of  WRITING-INSTRUMENT  in  an 
abstract  plan  that  creates  a  written  note.  This  framework  allows  an  agent  to  ignore 
superfluous  similarities  and  focus  on  often  subtle  function  similarities. 

The  notion  of  abstraction  is  important  when  considering  how  to  limit  explosive 
search.  In  an  effort  to  understand  more  about  the  relationship  between  more  or  less 
abstract  theories,  we  developed  a  model-theoretic  view  of  an  abstraction  relation 
between  first-order  logical  theories  in  terms  of  their  model  sets,  or  equivalently  their 
theorem  sets.  We  then  looked  at  simple  axiom  deletion  and  disjunction  introduction 
syntactic  operations  on  theories  and  showed  that  they  (non-strictly)  weaken  them, 
creating  consistent  abstracted  theories.  I  then  proved  two  key  theorems 
demonstrating  the  power  of  these  syntactic  operations,  including  what  assumptions 
must  be  made  to  generate  any  abstraction  of  a  given  theory. 

The  traditional  STRIPS  Assumption  stipulates  that  action  representations  are 
self-contained.  In  particular,  this  disallows  context-dependent  effects  and  therefore 
does  not  address  the  Ramification  Problem.  This  limitation  also  appears  in  some 
recent  formal  theories  of  action.  We  formulated  a  new  formal  theory  that  addresses 
the  competence  aspect  of  the  Ramification  Problem  through  a  different 
representation  of  action  based  on  basic  actions  that  do  not  have  a  fixed  context.  With 
appropriate  definitions  and  a  single  axiom,  important  laws  of  change  and  inertia 
become  true  that  allow  the  determination  of  resulting  states.  In  addition,  the 
temporal  model  allows  co-occurrence  and  non-occurrence  of  actions. 
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7A.4  Rhetorical 


T**  e  Project 

Rhetorical  (Rhet)  is  a  programming/knowledge  representation  system  that  offers 
a  formally  well  defined  set  of  tools  for  building  an  automated  reasoning  system.  It’s 
emphasis  is  on  flexibility  of  representation,  allowing  the  user  to  decide  if  the  system 
will  basically  operate  as  a  theorem  prover,  a  frame-like  system,  or  an  associative 
network. 

Rhet  offers  two  major  modes  of  inference:  a  horn  clause  theorem  prover 
(backwards  chaining  mechanism),  and  a  forward  chaining  mechanism.  Both  modes 
use  a  common  representation  of  facts,  namely  horn  clauses  with  universally 
quantified,  potentially  type  restricted,  variables,  and  use  the  unification  algorithm. 
Additionally,  they  botn  share  the  following  additional  specialized  reasoning 
capabilities: 

1.  variables  may  be  typed  with  a  fairly  general  type  theory  that  allows  a  limited 
calculus  of  types  including  intersection  and  subtraction; 

2.  full  reasoning  about  equality  between  ground  terms; 

3.  reasoning  within  a  context  space,  with  access  to  axioms  and  terms  in  parent 
contexts; 

4.  the  user  is  allowed  to  specify  specialized  reasoners  for  determining  if  two 
objects  of  a  special  type  will  unify; 

5.  escapes  into  LISP  for  use  as  necessary. 

Rhet  builds  on  some  fairly  recent  work  on  PROLOG  for  efficiency,  including  such 
features  as  ’limited’  intelligent  backtracking. 


Accomplished  in  1987 

By  the  end  of  the  year,  we  had  a  subset  of  the  full  Rhet  functionality  operational. 
In  particular,  we  currently  support: 

•  Reasoning  in  Multiple  Contexts1 . 

•  Unification  between  typed  Rhet  objects,  as  well  as  LISP  objects. 

•  E-unification  (equality)  is  fully  supported  contextually. 

•  A  preliminary  high  level  user  interface  (including  a  window  interface)  has 
been  implemented. 


Currently  this  is  not  completed  at  the  user  interface  level 


•  Semantics  for  structured  type  reasoning  has  been  defined  (this  is  like 
frames). 

•  Both  backward  and  forward  chaining  modes  are  operational. 

•  A  number  of  useful  builtin  functions  have  been  provided. 

•  The  interface  to  the  LISP  interpreter/compiler  has  been  completed. 

•  Constrained  variables  (posting  proofs)  is  fully  supported. 

In  addition  we  will  release  two  TRs  in  January  ’88,  one  the  user  manual  for  the 
system,  and  the  other  lower  level  documentation  for  programmers  who  will  maintain 
or  modify  the  system. 

Expectations  for  1988 

Ta  1988  we  hope  to  also  add: 

•  An  operational  structured  type  subsystem.  This  will  include  modifications  as 
needed  to  the  unifier  to  understand  the  implicit  inverse  relationship  between, 
say,  constructor  functions  and  role  functions.  Implementation  of  this 
subsystem  is  currently  in  progress. 

•  Handle  inequality  reasoning.  This  will  allow  a  user  to  declare  two  objects  to 
be  definitely  not  equal,  so  the  reasoner  will  not  attempt  to  solve  a  proof  by 
assuming  they  are  equal. 

•  Context  operators  at  the  user  interface  level  (e.g.  belief  modal  operators). 

•  Default  slot  values  for  our  structured  types. 

•  Goal  Caching  and  Intelligent  Cache  Flushing,  allowing  a  substantial 
performance  improvement. 

•  Left  Recursion  detection. 

•  Axiom  indexing  (fast  axiom  selection).  Mainly,  a  static  approach,  unlike 
incremental  hashing. 

•  Axiom  declarations  (allows  user  to  specify  even  better  indexing  by  declaring 
variables  to  be  locals,  globals,  downward,  etc.). 

•  Editor  interface  (including  incremental  compilation).  The  current  high  level 
interface  is  incomplete,  and  we  will  want  to  expand  it  as  we  get  more  live 
users. 

•  An  Axiom  Compiler.  This  could  give  roughly  anywhere  from  1  to  2  orders  of 
magnitude  performance  increase,  if  done  properly. 

•  Handle  limited  set  reasoning. 

•  Incremental  hashing  (a  performance  issue  for  KB  lookup). 
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Abstract 

The  literature  on  the  nature  and  representation  of  time 
is  full  of  disputes  and  contradictory  theories.  This  is 
surprising  since  the  nature  of  time  does  not  cause  any 
worry  for  people  in  their  everyday  coping  with  the  world. 
What  this  suggests  is  that  there  is  some  form  of  common 
sense  knowledge  about  time  that  is  rich  enough  to  enable 
people  to  deal  with  the  world,  and  which  is  universal 
enough  to  enable  cooperation  and  communication  between 
people.  In  this  paper,  we  propose  such  a  theory  and 
defend  it  in  two  different  wavs.  We  axiomatic  a  theory  of 
time  in  terms  of  intervals  and  the  single  relation  MFfcT. 
We  then  show  that  this  axiomatization  subsumes  Allen  s 
interval-based  theory.  We  then  extend  the  theory  by 
formally  defining  the  beginnings  and  endings  of  intervals 
and  show  that  these  have  the  properties  we  normally 
would  associate  with  points.  We  distinguish  between  these 
point-like  objects  and  the  concept  of  moment  as 
hypothesized  in  discrete  time  models.  Finally,  we  examine 
the  theory  in  terms  of  each  of  several  different  models. 

Introduction 

The  literature  on  the  nature  and  representation  of  time 
is  full  of  disputes  and  contradictory  theories.  This  is 
surprising  since  the  nature  of  time  does  not  cause  any 
worry  for  people  in  their  everyday  coping  with  the  world. 
What  this  suggests  is  that  there  is  some  form  of  common 
sense  knowledge  about  time  that  is  rich  enough  to  enable 
people  to  deal  with  the  world,  and  which  is  universal 
enough  to  enable  cooperation  and  communication  between 
people.  In  this  paper,  we  propose  such  a  theory  and 
defend  it  in  two  different  ways. 

First,  the  theory  is  powerful  enough  to  include  the 
distinction  between  "intervals”  (i.e.,  times  corresponding 
to  events  with  duration),  and  "points"  (i.e..  times 
corresponding  to  instantaneous  events),  as  well  as  allowing 
substantial  reasoning  about  temporal  ordering  relations 
(including  the  abilities  described  in  (Allen.  I984|).  In 
addition,  it  includes  a  formalization  of  the  beginning  and 
ending  of  events  by  introducing  the  corresponding 
beginning  and  endings  of  times.  We  show  that  beginnings 
and  endings  act  in  many  ways  like  "points,"  yet  can  be 
distinguished  from  them. 


I'his  research  was  supported  in  part  by  ihc  Office  of  Naval  Research 
under  grant  Nfl00l4-8Q-C-197,  the  National  Science  Foundation 
under  grant  l)CR-8J5lh«5  and  the  US  Army  under  grant  IMAK2I- 
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Second,  the  theory  has  as  allowable  models  a  number 
of  the  temporal  models  that  are  suggested  in  the  literature. 
This  includes  models  that  equate  time  with  intervals  and 
points  on  the  real  number  line,  models  that  hypothesize 
discrete  time,  and  any  model  which  mixes  real  points  and 
intervals.  Our  claim  is  that  if  our  common-sense  theory  of 
time  excluded  any  one  of  these  models,  then  there  would 
be  no  debate  as  to  whether  that  model  was  valid,  since  in 
that  case  our  own  primitive  intuitions  on  the  matter  would 
be  extremely  clear.  We  do  make  one  restriction  on  the 
models  considered:  they  must  allow  the  possibility  that 
two  intervals  MEET,  which  is  defined  as  the  situation 
where  there  is  no  time  between  the  two  intervals,  and  no 
time  that  the  intervals  share.  The  importance  of  this 
relationship  for  naive  theories  of  (ime  has  been  argued 
elsewhere  (e.g.,  (Allen.  1983;  1984]).  and  so  will  not  be 
defended  again  here.  Even  with  this  requirement,  we  shall 
see  that  substantially  different  models  are  possible. 

One  important  intuition  which  guides  us  is  that  time  is 
occupied  by  events.  If  the  universe  did  not  change,  there 
would  be  no  time.  Any  sort  of  event  or  happering  which 
can  be  described  or  thought  of  has  a  corresponting  time, 
and  the  universe  of  times  consists  of  these.  We  v  ||  often 
appeal  to  this  intuition,  which  notoriously  sc  i  etimes 
indicates  continuity  and  sometimes  discreients: .  (In 
particular,  it  is  the  source  of  the  need  to  allo\  time 
intervals  to  be  able  to  MEET.) 

In  Section  I.  we  axiomatize  a  theory  of  time  in  terms 
of  intervals  and  the  single  relation  MEETS.  It  is  then 
shown  in  Section  II  that  this  axiomatization  subsumes  the 
interval-based  theory  proposed  in  (Allen,  1983:  1984], 

We  then  extend  the  theory  in  Section  III  by  formally 
defining  the  beginnings  and  endings  of  intervals  and  show 
that  these  have  the  properties  we  normally  would  associate 
with  points.  In  Section  IV,  a  distinction  is  made  between 
these  point-like  objects  and  the  concept  of  moment  as 
hypothesized  in  discrete  time  models.  Finally,  in  Section 
V,  we  examine  the  theory  in  terms  of  each  of  several 
different  models. 

This  paper  is  a  condensed  version  of  a  report.  (Allen  & 
Hayes.  1985|,  henceforth  referred  to  as  the  longer  paper . 
which  presents  additional  discussion  and  the  proofs  for  all 
the  theorems  below. 
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Relation 

Inverse 

Definition 

BEFORE.  b 

AFTER,  a 

EXISTS  k  .  1  MHFriS  k  MEETS  J 

3 

= 

EXISTS  k.j .  k  MEETS  1  MEETS  1  A 
k  MEETS  J  MEET  S  1 

OVERLAPS,  o 

OVERlj\PPKD-BY.  oi 

EXISTS  a.b.c.d.e .  a  MEETS  1  MEETS  d  MEETS  c  Sc 
a  MEETS  b  MEETS  J  MEET  S  c  Sc 
b  MEETS  c  MEETS  d 

STARTS,  s 

SrARTF.D-BY.  si 

EXISTS  a.b.e  .  a  MEETS  1  MEETS  b  MEETS  c  Sc 
a  MEETS  J  MEETS  c 

FINISHES,  f 

FINISHF.D-BY.  ft 

EXISTS  a.b.e  a  MEETS  b  MEETS  1  MEETS  c  & 
a  MEETS  J  MEETS  c 

DURING,  d 

CONTAINS,  di 

EXISTS  a.b.c.d .  a  MEETS  b  MEET S  1  MEETS  c  MEETS  d  Sc 
a  MEEIS  i  MEETS  d 

Figure  1:  The  Relationships  Between  I  and  I  in  terms  of  the  MEETS  Relation 


f.  An  Axiomatization  of  Interval  Time 

We  start  the  formal  development  by  positing  a  class  of 
objects  in  our  ontology  that  we  shall  call  TIMES.  These 
are  intended  to  correspond  to  our  intuitive  notion  of  when 
some  event  occurs.  We  do  not.  at  this  early  stage,  make 
any  committment  as  to  whether  all  times  are 
decomposable  or  not 

The  essential  requirement  of  our  intuition  above  is  that 
two  time  intervals  can  MEET.  We  will  take  MEET  as  our 
primitive  relationship  between  times  and  show  that  we  can 
constructively  define  the  complete  set  of  possible 
relationships  between  intervals  in  terms  of  MEETS.  Other 
reductions  to  a  small  set  are  possible;  for  example, 
Hamblin  [1972]  uses  a  relation  we  could  define  as  less* 
than-or- MEETS.) 

For  example,  we  can  define  a  relationship  BEFORE  to 
hold  between  intervals  only  if  there  exists  an  interval  that 
spans  some  time  between  them.  Thus 


I  BEFORE  J<=  =>  EXISTS  k .  I  MEETS  k A 

k  MEETS  J. 

As  a  notational  convenience,  we  shall  abbreviate 
conjunctions  such  as  the  above  into  a  chain,  i.e.,  I  MEETS 
k  MEETS  J.  We  shall  also  use  the  abbreviations  used  in 
[Allen,  1983]  for  disjunctions  between  pairs  of  intervals. 
Thus  "J  (o  oi  s  f  d)  I"  is  shorthand  for  the  formula 

(J  OVERLAPS  I)  OR  (J  OVERLAPPED-BY  I) 

OR  (J  STARTS  I)  OR  (J  FINISHES  I) 

OR  (J  DURING  I). 

All  the  possible  relationships  between  times  are 
defined  in  Figure  l.  By  including  the  inverses  of  these 
relations  in  the  obvious  way.  we  have  thirteen  relationships 
defined  constructively  in  terms  of  MEET.  Each  entry 


defines  the  ordered  relation  between  I  and  J  (I  BEFORE 
J.  1  OVERLAPS  J.  etc.).  The  inverses  are  also  between  I 
and  J  and  are  equivalent  to  the  original  relationship 
between  J  and  I  (e.g..  I  BEFORE  J  <  =  =  >  J  AFTER  I, 
etc.).  The  small  letters  listed  with  each  give  the 
abbreviation  for  the  relation  that  will  be  used  later  in  some 
examples. 

With  this  reduction,  we  can  axiomatize  the  interval 
logic  entirely  in  terms  of  the  MEETS  relation,  as  follows. 

The  first  two  axioms  are  based  on  the  intuition  that 
intervals  have  a  unique  beginning  position  and  a  unique 
ending  position.  As  a  consequence  of  this,  if  two  intervals 
both  meet  a  third  interval,  then  any  interval  that  one 
meets,  the  other  meets  as  well. 

Axioms  for  Uniqueness  of  "Meeting  Places": 

(Ml)  ALLij. 

(EXISTS  k  .  I  MEETS  k  &  J  MEETS  k)  => 
(ALLI.  I  MEETS  1<  =  >  J  MEETS  1) 

(M2)  ALL  ij . 

(EXISTS k.k  MEETS  lAk  MEETS  i)  => 
(ALLI .  I  MEETS  I  <  =  >  i  MEETS  J) 

The  third  axiom  captures  the  notion  of  ordering.  It 
simply  states  that  given  two  "places”  where  two  intervals 
meet,  then  these  places  are  either  equal  or  one  precedes 
the  other.  This  is  axiomatized  without  referring  to  places  as 
follows: 

Ordering  Axiom: 

(M3)  ALL  ij.k.l . 

(i  MEETS  jAk  MEETS  I)  => 

1)  (i  MEETS  l)XOR 

2)  (EXISTS  m.iM  PETS  m  MEETS  I)  XOR 

3)  (EXISTS  n.k  MEETS  n  MEETS  j) 
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In  other  words,  we  have  exactly  three  possible  cases, 
shown  in  Figure  2,  for  any  four  intervals  i.  j.  k.  and  I. 

i  I  j  »  I  j  i  I  j 

It  I  l  I  m  |  I  n  | 

k  |  I  k  |  1 

Case  1  Case  2  Case  3 

Figure  2:  The  Three  Possible  Orderings 
of  i.  j.  k.  and  l  in  Axiom  M3 

Finally,  we  need  some  existence  axioms.  First,  given 
any  interval,  there  exists  an  interval  that  meets  it,  and  an 
interval  that  it  meets,  i.e.. 

(M4)  ALL  i  EXISTS  j.  k .  j  MEETS  i  MEETS  k 

i  e..  j  |  i  |  k 

A  consequence  of  this  axiom  is  that  no  infinite  time 
intervals  are  allowed  in  our  theory. 

We  need  one  more  existence  axiom,  guaranteeing  the 
existence  of  an  interval  which  is  the  "union"  or  sum  (+) 
of  two  adjacent  intervals,  defined  by: 

(M5)  ALL  U  .  I  MEETS  J  => 

EXISTS  a.b.(l+J)  . 

a  MEETS  I  MEETS  J  MEETS  b 
&  a  MEETS  (I  +  J)  MEETS  b 

i.e.,  |  a  |  l  |  J  |  b  | 

I  l  +  J  I 

Using  the  defined  relations  above,  this  axiom  can  be 
restated  as 

ALL  U  .  I  MEETS  J  =>  EXISTS  (l  +  J)  such  that 

I  STARTS  (l  +  J)  & 
J  FINISHES  (l  +  J) 

We  can  prove  that  when  l  +  J  exists  it  is  unique,  and  that 
+  is  associative. 

With  these  five  axioms  and  the  definitions  given  in  figure 

I.  the  entire  transitivity  table  for  interal  relationships  given 
in  [Allen,  1983]  can  be  derived.  Thus,  this  set  of  five 
axioms  concisely  captures  that  logic.  This  is  not  to  say, 
however,  that  an  implementation  should  not  use  the 
expanded  set  of  relations.  There  are  some  important 
efficiency  gains  from  the  larger  set  of  primitives,  as 
described  in  [Allen,  1983). 

II.  Nests:  Beginnings  and  Endings 

There  are  classes  of  events  described  in  English  that 
cannot  be  associated  with  a  temporal  duration.  These  are 
often  called  "instantaneous"  events,  or  "accomplishments" 
(e.g.,  [Mourelatos.  1978)).  Thus,  we  can  say  "I  closed  the 
door,"  but  if  we  say  "I  closed  the  door  for  three  hours,"  it 
means  we  are  repeatedly  performing  the  action  (contrast  "1 
sat  on  the  floor.").  Similarly,  a  click,  or  the  flash  of  a 


strobe,  cannot  be  qualified  by  a  duration.  Furthermore, 
the  world  after  a  click,  or  flash,  could  be  essentially  the 
same  as  before  it.  showing  that  these  events  cannot  be 
identified  with  simple  changes  of  state. 

One  common  approach  to  handling  the  times  for  such 
events  is  to  model  them  as  points  (real  points  in  the 
continuous  model:  integers  in  the  discrete  model).  In  this 
section  and  the  following  one,  we  shall  develop  two 
distinct  notions  of  points  from  our  interval  logic.  These 
will  be  compared  in  the  final  section. 

In  this  section  we  shall  construct  the  equivalent  of 
points  within  the  interval  logic  defined  in  Section  II  by 
adopting  a  variant  of  filters,  one  of  the  standard 
mathematical  constructions  of  points  from  intervals. 

In  particular,  we  define  the  beginning  of  an  interval  to 
be  the  set  of  all  intervals  that  "touch  the  beginning"  in  any 
way.  and  the  end  similarly.  We  can  define  the  ending  of  an 
interval  similarly. 

BEGfN(I)  =  {  p  |  p  (o  s  m  ft  di  e  si)  I[ 

This  can  be  defined  solelv  in  terms  of  the  MEETS  relation 
if  one  desires,  but  the  above  definition  is  simplest  to 
understand.  For  convenience,  we  can  define  a  nest  as  a 
beginning  or  an  ending,  and  can  now  define  relations  over 
the  set  of  nests  which  show  them  to  have  the  properties  of 
points.  We  shall  say  a  nest  N  is  before  a  nest  M  iff  there  is 
at  least  one  interval  in  N  that  is  before  some  interval  in  M. 

for  any  two  NESTS.  N  and  M 

N  <  M  <=  =>  EXISTS  n.  m  .n€N&m€M& 

n  <  m 

We  show  in  the  longer  paper  that  nests  have  the  important 
properties  of  points.  The  main  result  is  that  nests  are 
totally  ordered.  i.e.. 

Theorem  8:  For  any  two  nests  N  and  M, 

either  N  <  M.  M  <  N  or  N  =  M 

We  can  also  show  that  the  intuitive  definitions  of  the 
interval  relations  in  terms  of  nests  are  theorems.  For 
example,  we  have 

BEGIN(I)  <  ENCHO 

I  MEETS  J  <==>  ENCHO  =  BEGIN(J) 

I  OVERLAPS  J  <==>  BEGIN(I)  <  BEGIN(J)  & 

BEGIN(J)  <  ENCHO  & 
END(0  <  ENCHJ) 

The  second  of  these  is  especially  important  as  it  shows 
that  there  is  only  one  "place"  where  two  meeting  intervals 
actually  meet  This  is,  perhaps  surprisingly,  a  delicate 
matter.  Very  small  changes  in  the  definitions  of  BEGIN 
and  END  fail  to  achieve  this.  It  is  perilously  easy  to  get  a 
point  structure,  which  distinguishes  two  "sides"  of  a  single 
point  and  other  oddities,  as  discussed  in  [Van  Benthem. 
1982).  (We  are  grateful  to  Professor  Dana  Scort  for 
bringing  this  and  Hamblin's  work  to  our  attention,  and 
emphasizing  some  of  these  subtleties.)  We  discuss  this  at 
greater  length  in  the  longer  paper. 
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IV.  Discrete  Time  and  Time  Points 

We  can  now  show  that  discrete  time  models  introduce 
a  different  kind  of  "point"  than  the  points  that  are  defined 
above.  In  particular,  discrete  time  hypothesizes  times  that 
are  not  decomposable.  Let  us  introduce  a  distinction 
between  true- intervals  and  moments  as  follows: 

ALL  I  .  TRUE-INTERVAL(I)  <  =  > 

EXISTS  a.b.c.d  .  a  MEETS  I  MEETS  d 
&  a  MEETS  b  MEETS  c  MEETS  d 

ALL  I  .  MOMENTO)  <  =  >  -TRUE-INTERVAL(l) 

Thus,  a  true-interval  has  at  least  two  sub-intervals  (which 
might  in  turn  be  moments  or  true-intervals)--one  that 
STARTS  it  and  one  that  FINISHES. 

Before  we  continue,  it  is  important  to  remember  that 
all  of  the  earlier  theorems  were  proven  before  any 
distinction  was  made  between  moments  and  true-intervals. 
so  they  all  hold  for  both  classes:  none  of  the  proofs  ever 
depended  on  the  decomposabiiity  of  an  interval.  These 
definitions  allow  us  to  prove  that  two  moments  cannot 
overlap  in  any  way.  yet  they  can  MEET  each  other.  More 
precisely. 

ALL  IJ  .  MOMENTfl)  &  MOMENTfl)  => 

I  «  m  =  mi  >)  J 

Let  us  now  consider  the  relationship  between  nests  and 
moments.  The  definition  of  nests  did  not  exclude  nests 
defined  at  the  beginning  or  ending  of  moments.  In  fact,  we 
can  show  that  the  beginning  of  a  moment  is  before  the 
ending  of  that  same  moment!  Thus,  although  a  moment 
cannot  be  decomposed,  we  can  distinguish  its  beginning 
from  its  ending. 

We  can  also  show  that  moments  and  nests  cannot  be 
considered  to  be  isomorphic  to  each  other.  This  is  easily 
seen  from  the  observation  that  moments  can  MEET  each 
other,  whereas  nests  cannot.  Intuitively,  a  moment  is  a 
time  during  which  some  event  (a  flash,  a  bang)  occurs, 
while  a  nest  defines  an  abstract  "position"  in  the  sequence 
of  times. 

V.  Discussion 

It  is  interesting  to  interpret  these  axioms  in  various 
possible  models.  The  simplest  one  is  discrete  time: 
intervals  are  pairs  of  integers  <n,m>  with  n  <  m.  and  <n,m> 
MEETS  <m,k>.  Then  a  moment  is  a  nondecomposable 
interval  <n.n+l>,  and  nests  pick  out  integers,  the  places 
"between"  moments.  In  this  model  there  is  a  clear 
distinction  between  moments  and  points.  We  can  also 
define  several  models  based  on  the  real  line.  For  example, 
time  intervals  can  be  mapped  into  open  or  closed  real 
intervals:  however,  then  times  can  never  MEET.  A  simpler 
continuous  model,  based  on  the  integer  model  above. 


defines  time  intervals  as  pairs  of  reals  <a.b>.  with  <a.b> 
MEETS  <b.c>.  Following  through  the  axiomatic 
definitions  with  this  as  a  basis  makes  nests  define  points  on 
the  real  line,  as  expected,  but  now  there  are  no  moments  at 
alt.  since  even  the  smallest  interval  is  decomposable.  We 
might  try  to  extend  the  model  to  allow  intervals  of  the 
form  <a.a>.  which  would  qualify  as  moments,  but  now 
consider  <a.b>.  <b.b>  and  <b.c>.  By  our  definitions,  the 
first  MEETs  the  last  yet  they  have  the  second  between 
them,  so  the  first  is  BEFORE  the  last,  violating  the 
ordering  axiom.  We  have  tried  to  fit  real,  substantial- 
though  very  small-time  intervals  into  merely  mathematical 
"places."  and  they  don't  lit. 

However,  another  possible  model  is  one  which  mixes 
these,  using  the  same  definitions  of  interval  and  MEET 
(from  which  all  else  follows)  but  allowing  parts  of  the  time 
line  to  be  discrete  and  parts  to  be  continuous.  Intuitively, 
if  we  have  only  coarse  time  measuring  tools  available,  then 
we  can  treat  time  as  discrete,  but  the  possibility  always 
remains  of  turning  up  the  temporal  magnification 
arbitrarily  far.  if  we  have  access  to  events  which  can  make 
the  finer  distinctions,  distinctions  which  can  split 
"moments"  into  smaller  and  smaller  parts. 

Our  axiomatic  theory  allows  all  of  these  models  and 
others:  it  is  uncommitted  as  to  continuity  or  discreteness  of 
the  sequence  of  times,  yet  is  powerful  enough  to  support  a 
great  deal  of  the  temporal  reasoning  of  common  sense. 
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Abstract 

The  quantity  of  resources  that  an  agent  expends  in 
solving  problems  in  a  given  domain  is  determined  by 
the  representations  and  search  control  strategies  that 
it  employs.  The  value  of  individual  representations  or 
strategies  to  the  agent  is  determined  by  their 
contribution  to  the  resource  expenditure.  We  argue 
here  that  in  order  to  choose  the  component 
representations  and  strategies  appropriate  for  a 
particular  problem  domain  it  is  necessary  to  measure 
theircontribution  to  the  resource  esuemnture  on  the 
actual  problems  the  agent  faces.  Hits  is  as  true  for  a 
system  designer  making  such  choices  as  it  is  for  an 
autonomoua  mechanical  agent.  We  present  one  way  to 
measure  this  contribution  and  give  an  example  in 
which  the  measure  is  used  to  improve  problem  solving 
performance. 

I  Introduction 

A  primary  goal  of  Artificial  Intelligence  research  is  to 
enable  automated  agents  to  have  general  reasoning 
abilities  over  a  wide  range  of  domains.  Due  to  the 
inherent  computational  complexity  of  this  task,  system 
designers  make  performance  choices  that  trade 
generality  for  improved  resource  use.  Such 
performance  choices  are  necessary  for  any  agent  with 
limited  amounts  of  space  and  time  to  be  effective 
within  a  dynamic  world.  Most  running  AI  systems 
have  embedded  within  them  the  choices  that  their 
designers  felt  would  maximise  their  usefulness. 
Unfortunately,  such  choices  rely  on  hidden 
assumptions,  such  as  what  variety  and  frequency  of 
problems  will  be  encountered,  or  now  correct  or  close  to 
optimal  the  eventual  solution  should  be.  Often,  the 
designers  themselves  may  not  know  what  these 
assumptions  are,  since  they  are  further  obecured,  for 
example,  by  choices  implicit  in  the  implementation  of 
the  representation  language,  or  by  insufficient  prior 
information  on  the  range  of  problems  that  the  system 
may  encounter.  Unfortunately,  this  makes  it  difficult 
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for  other  researchers  to  determine  if  the  tradeoffs  are 
appropriate  for  their  problem  domain. 

In  this  paper,  we  argue  that  performance  choices 
should  be  made  explicit  in  order  to  establish  measures 
with  which  different  choices  can  be  compared.  Further, 
we  provide  an  example  of  how  this  might  be  done  by 
developing  a  cost  metric  defined  over  search  control 
strategies.  The  short-  term  benefit  of  this  approach  is 
that  system  designers  will  have  a  formalism  which  can 
be  applied  in  determining  preferred  performance 
choices  for  the  problems  they  are  solving.  The  long¬ 
term  benefit  is  that  an  automated  agent  will  have  the 
ability  to  evaluate  its  success  in  satisfying  its  goals  in 
comparison  with  other  candidate  control  strategies. 

2  Intractability 

Intelligent  agents  perform  computations  upon  an 
internally  stored  representation  of  the  world. 

Examples  in  [Levesque  and  Brachman,  1985]  show  how 
the  choice  of  representation  language  bears  ou  the 
computational  complexity  of  solving  problems  using 
that  language.  Aa  they  point  out,  reasoning  systems 
"will  either  be  limited  in  what  knowledge  they  can 
represent,  or  unlimited  in  the  reasoning  effort  they 
might  require."  For  example,  if  we  use  a  first-order 
predicate  representation  language,  then  there  exists  no 
algorithm  that  will  decide  for  any  theory  T and 
sentence  S  encoded  in  this  language  whether  S  is  a 
theorem  of  T.  However,  it  might  be  the  case  that  a 
particular  theory  T  in  which  we  are  interested  can  be 
encoded  within  a  weaker  representation,  such  as  finite 
automata.  Then  there  exist  algorithms  of  bounded 
computational  complexity  that  will  answer  most 
questions  with  regard  to  this  theory,  such  as  if  a  string 
is  accepted  by  the  automaton.  Unfortunately,  no 
algorithm  exists  that  decides  whether  some  first-order 
theory  can  also  be  expressed  in  a  weaker 
representation  language. 

We  wifi  say  that  a  problem  P  is  intractable  if  there  is 
no  expression  language  L  in  which  to  state  problem 
instances  of  P  such  that  there  is  a  deterministic  Turing 
machine  that  can  compute  solutions  to  such  instances 
within  polynomial  time  and  space.  The  following,  a 
version  of  the  travelling  salesman  problem,  is  an 
example  of  a  problem  believed  to  be  intractable. 
Imagine  that  an  agent  finds  itself  in  a  city  and  is  given 
a  set  of  n  buildings  to  visit,  along  with  information  as 
to  the  distance  between  each  pair  of  buidings.  The 
question  to  determine  is  whether  the  agent  can  visit 
each  building  exactly  once  on  a  path  no  more  than  k 
unite  long.  It  is  possible  that  the  agent  can  answer  this 
question  quickly  for  some  given  set  of  parameters,  but 
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it  it  believed  that  there  is  no  deterministic  algorithm 
(serial  or  parallel)  that  will  solve  every  possible 
instance  of  this  problem  within  an  amount  of  time 
polynomial  in  the  number  of  buildings  in  the  problem. 
Note  that  this  is  not  a  function  of  the  representation, 
but  of  the  problem,  since  it  is  believed  that  there  exists 
no  representation  for  which  this  problem  is  tractable. 
Many  problems  that  agents  are  called  upon  to  solve  are 
intractable. 


The  fact  that  an  agent  will  encounter  difTicult  problems 
does  not  mean  that  the  agent  should  make  no  attempt 
to  solve  them.  But  in  order  to  perform  effectively,  an 
agent  must  reason  not  about  whether  it  can  solve  all 
possible  problems  optimally  but,  rather,  how  it  can 
maximise  resource  use  over  the  entire  sample  of 

Coblems  that  it  will  encounter.  In  order  to  do  this,  we 
lieve  that  defining  notions  of  problem  class  and 
resource  cost  inessential. 

3  Problem  Classes 


Standard  complexity  theory  typically  defines  a 
problem  class  in  terms  of  the  worst-case  time  or  space 
Behavior  of  a  particular  computational  model,  as  a 
function  of  the  size  of  the  input  problem.  For  example, 
the  travelling  salesman  problem  mentioned  above  falls 
in  the  class  NP-TIME,  which  means  that  there  is  a 
non-deterministic  Turing  machine  that  solves  every 
instance  of  a  travelling  salesman  problem  within  time 
polynomial  in  the  size  of  the  input  The  fastest  known 
deterministic  Turing  machines  for  these  problems  take 
at  least  time  exponential  in  the  size  of  the  input 
Unfortunately,  it  is  not  known  whether  there  are  faster 
deterministic  algorithms.  It  is  in  fact  quite  difficult  to 
determine  what  is  the  fastest  algorithm  one  can 
construct  for  a  given  problem. 


There  are  several  aspects  of  this  definition  of  class 
which  are  inadequate  for  the  task  of  optimizing  an 
agent's  use  of  its  resources.  First,  each  input  problem 
instance  to  an  algorithm  is  assumed  to  be  in  the  correct 
form  so  that  all  operations  of  the  algorithm  are  defined. 
For  example,  a  typical  sorting  algorithm  requires  a  list 
of  elements  from  a  set  as  input.  The  interpretation  of 
this  data  structure  is  implicitly  defined  by  the 
algorithm.  Intelligent  agents,  however,  are  required  to 
determine  from  general  representations  of  its  input 
which  specific  type  of  problem  its  input  is  an  instance 
of.  For  example,  if  an  agent  has  the  goal  of  ordering  a 
set  of  objects,  it  must  recognize  that  this  is  a  sorting 
problem  and  that  there  are  algorithms  for  solving  such 
problems.  The  difficulty  of  this  recognition  task  will  be 
a  property  of  the  language  in  which  sorting  problems 
are  expressed. 


Second,  wont-case  behavior  is  often  too  pessimistic.  In 
practice  the  worst  case  may  occur  infrequently.  Third, 
there  might  be  restrictions  one  can  place  on  the  input 
that  allow  a  subset  of  the  problems  to  be  solved  quickly. 
For  example,  in  the  travelling  salesman  problem,  if  the 
cities  are  known  to  be  collinear,  then  the  solution  can 
be  trivially  generated.  Typically,  general  algorithms 
do  not  attempt  to  determine  if  the  given  input  falls 
within  one  of  the  easier  restricted  subsets. 


We  will  consider  a  much  weaker  notion  of  problem 
class.  In  general,  an  algorithm  will  define  a  problem 
class,  in  that  a  problem  instance  is  a  member  of  the 


class  if  the  algorithm  solves  that  instance.We  can 
assume  that  any  complex  agent  will  be  called  upon  to 
solve  a  variety  of  tasks,  some  as  easy  as  finding  the 
largest  element  from  a  set,  and  some  at  least  as 
difficult  as  the  travelling  salesman  problem.  If  a 
domain  is  encoded  using  a  general  representation  for 
which  there  exists  an  algorithm  capable  of  solving  each 
problem  expressible  in  this  representation,  then  we  can 
consider  an  agent  using  this  algorithm  as  solving 
problems  from  a  single  class.  That  is,  every  problem 
that  the  agent  encounters  falls  within  the  large 
problem  class  defined  by  the  algorithm. 

Many  algorithms  fail  to  distinguish  between  tasks  of 
various  difficulties,  as  was  noted  earlier  with  the 
travelling  salesman  problem.  Due  to  the  general 
nature  of  the  algorithms,  a  high  computational  cost  is 
incurred  when  solving  both  the  easy  and  the  hard 
tasks.  Taking  a  more  general  example,  suppose  an 
agent’s  domain  is  represented  in  the  first  order 
predicate  calculus,  and  the  agent  is  able  to  solve  each 
problem  it  is  presented,  i.e.,  each  theorem  to  be  proven, 
using  a  complete  proof  procedure.  Unfortunately,  thex 
proof  procedures  take  greater  than  exponential  time  in 
the  length  of  the  input  to  generate  a  proof  in  the 
general  case,  if  a  proof  exists.  Assuming  that  some  of 
the  input  problems  involve  finding  the  largest  element 
from  a  set,  the  agent  is  expending  considerably  more 
resource  than  necessary  to  solve  these  problems  using 
the  general  proof  procedure.  That  is,  if  the  agent  were 
able  to  distinguish  these  problems  at  low  cost,  then  the 
agent  could  use  one  of  the  known  polynomial  time 
algorithms  for  their  solution.  This  fact  motivates  the 
use  of,  for  example,  procedural  attachment  in  theorem 
proving  [Nilsson,  1980]. 

An  agent,  then,  can  be  provided  with  a  repertoire  of 
algorithms,  each  one  optimized  to  a  particular  set  of 
problems.  The  agent  can  be  considered  to  encounter 
problems  from  a  variety  of  classes  -  one  problem  class 
for  each  algorithm.  Unlike  the  standard  complexity- 
theoretic  model  where  the  algorithms  are  never 
required  to  distinguish  between  the  various  restricted 
subcases,  we  assume  that  complex  agents  will  be 
obliged  to  do  so.  There  will  therefore  be  a  cost 
associated  with  determining  into  which  class  a  given 
input  problem  falls,  i.e,  which  algorithm  should  be 
used  to  solve  the  problem.  There  might  thus  be  no 
computational  advantage  to  an  agent  in  having  a  large 
repertoire  of  algorithms.  The  usefulness  of  a  member 
algorithm  will  be  a  function  both  of  the  frequency  with 
which  problems  falling  within  this  algorithm’s  class 
are  encountered  by  the  agent,  and  of  tne  comparative 
costs  of  solving  problems  using  this  algorithm  versus 
other  candidate  algorithms.  By  considering  the  entire 
set  of  encountered  problems  as  falling  into  a  variety  of 
problem  classes,  the  statistical  properties  of  the 
various  classes  can  be  exploited  to  improve 
performance.  A  faster  running  time  of  an  algorithm 
whose  class  occurs  with  high  relative  frequency  will 
improve  the  overall  performance  of  the  agent  The 
longer  a  period  of  time  over  which  an  agent  solves 
problems,  the  greater  will  be  the  gains  from  the 
algorithm  optimization,  assuming  the  relative 
frequencies  remain  invariant.  Hence  such  sustained 
problem  solving  activity  justifies  the  expenditure  of 
resource  to  perform  the  optimization. 
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4  Coats 


Thera  are  few  precedents  in  A1  that  involve  measuring 
coats.  One  example  is  the  A*  algorithm  [Nilsson, 

1980].  A*  is  an  algorithm  for  searching  a  state  space, 
described  by  an  initial  state,  a  goal  state,  and  the  legal 
transitions  between  states,  which  computes  a  path 
between  the  initial  and  goal  states.  This  algorithm  has 
the  characteristic  that  it  is  admitsable,  meaning  that 
the  first  path  it  finds  that  solves  the  problem  will  be  of 
minimum  length,  if  any  solution  path  exists.  The 
algorithm  works  by  searching  through  the  current 
least-coat  transition,  when  cost  is  a  function  of  the 
number  of  transitions  already  taken  on  that  path,  and 
an  estimate  of  the  number  of  transitions  required  to 
reach  the  goal  along  that  path.  This  cost,  therefore, 
only  measures  the  length  of  the  solutipn.  A*  makes  no 
attempt  to  model  the  resources  required  to  arrive  at  a 
state,  which  includes  the  resources  expended  on  the 
other  incomplete  or  failed  attempts  up  to  that  point  in 
the  computation.  It  is  just  such  a  measure  of  total 
computational  costs  that  we  are  attempting  to  model. 
This  replaces  search  for  minimal  length  solutions  with 
search  for  minimal  resource  use  solutions. 

A*  places  a  premium  on  the  optimality  and  correctness 
of  every  solution  it  reaches,  which  makes  it  generally 
unsuited  for  the  task  of  searching  through  complex 
domains.  It  will  typically  be  cost-effective  for  an  agent 
to  trade  some  degree  of  minimality,  completeness  and 
correctness  of  the  solution  for  a  speed-up  in  the  time 
required  to  compute  it  For  example,  several 
polynomial  time  algorithms  have  been  constructed 
that  find  sub-optimal  solutions  to  NP-complete 
problems  [Garay  and  Johnson,  1979].  Many  of  these 
algorithms  are  such  that  the  larger  the  upper  bound  on 
the  acceptable  distance  between  an  approximate 
solution  and  the  optimal  solution,  the  faster  the 
algorithm  returns  one  of  these  approximate  solutions. 

Augmenting  the  symbolic  approach  of  AI  by  decision 
theoretic  techniques  is  suggested  in  [Feldman  and 
Sproull,  1977].  It  is  argued  there  that  for  many 
problema  in  AI  there  are  natural  numeric  measures  of 
cost  and  that  it  is  only  on  the  basis  of  such  measures 
that  good  decisions  can  be  made.  That  is,  a  decision 
theoretic  form  is  an  appropriate  representation  for 
certain  problems.  In  the  search  for  a  solution  to  a 
problem  instance,  the  paper  discusses  how  to  select 
Between  alternate  plans,  how  to  deal  with  the 
uncertainty  of  the  outcome  of  plan  steps  and  how  to 
introduce  knowledge  producing  actions  into  plans.  The 
basic  mechanism  is  to  compute  the  expected  utility  of  a 

groblem  instance’s  candidate  solutions.  Our  goal  here 
i  to  investigate  the  expected  utility,  or  more 
specifically,  the  expected  computational  expenditure, 
of  a  problem  solving  strategy  over  an  agent's  entire 
sample  of  problem  instances.  Similar  to  choosing,  from 
a  set  of  candiates,  a  solution  that  maximises  expected 
utility,  we  intend  to  choose  from  a  set  of  candidate 
problem  solving  strategies  one  that  maximizes 
expected  computational  cost 

Also  discussed  in  [Feldman  and  Sproull  1977]  is  the 
utility  of  additional  planning.  If  an  agent  possesses  a 
solution  to  a  problem  then  in  order  for  further  planning 
effort  to  be  profitable,  the  improvements  in  the  solution 
must  offset  the  additional  coat  In  a  similar  way  the 


measurements  and  computation  required  to  determine 
the  expected  computational  cost  of  a  strategy  must  be 
offset  by  the  performance  improvement  that  results 
from  this  improvement 

As  mentioned  earlier,  finding  lower  bounds  proofs  is 
difficult  for  people,  and  will  certainly  be  so  for 
automated  agents.  This  meaas  that  most  agents  will 
rarely  be  able  to  prove  that  they  are  doing  the  best 
possible.  Therefore,  it  will  usually  be  in  the  interests  of 
an  agent  (and  its  designer)  to  express  its  performance 
choices  in  such  a  way  that  measuring  the  efficacy  of 
these  choices  is  possible.  This  requires  considering  all 
resource  expenditure  as  cost  and  expected 
improvements  of  one  strategy  over  another  as  benefits. 
We  claim  that  for  agents  who  are  called  upon  to  make 
decisions  in  domains  containing  intractable  problems, 
performance  choices  can  only  be  made  with  respect  to 
the  sample  of  problem  instances  that  an  agent 
encounters.  Whereas  a  particular  performance  choice 
may  work  quite  well  for  a  given  problem  sample,  it 
may  give  equally  poor  performance  for  another  sample. 
That  is,  agents  cannot  solve  all  problem  instances 
optimally  within  such  domains.  By  making  the 
performance  choices  explicit,  it  is  possible  to  evaluate 
whether  a  given  choice  is  a  good  one  for  a  particular 
sample.  Thus,  one  is  able  to  exploit  any  information 
about  the  sample  of  problems  that  is  believed  will  be 
encountered  in  the  future.  Although  the  properties  of 
future  problems  are  unknown,  predictions  can  be  based 
upon  the  sample  of  problems  that  have  already  been 
seen.  One  of  the  long-term  objectives  of  the  approach 
advocated  here  is  to  state  in  a  domain  independent 
fashion  those  invariant  properties  of  a  particular 
domain  or  sample  of  problems  that  justify  the  belief 
that  one  has  a  good  algorithm  for  the  domain.  Such 
domain  independent  properties  also  provide 
justification  for  applying  the  algorithm  to  other 
domains  that  are  similar  to  the  original  domain  with 
respect  to  these  properties.  Additionally,  future 
performance  improvements  may  also  transfer  across 
these  similar  domains. 

5  Definitions 

This  section  defines  a  cost  metric  for  problem  solving 
that  illustrates  some  of  the  points  raised  above.  This 
metric  corresponds  to  the  expected  cost  per  problem 
that  an  agent  expends.  The  metric  is  based  on  problem 
classes  and  the  relative  frequency  that  a  given  problem 
is  a  member  of  a  particular  class.  The  solution  strategy 
used  by  an  agent  partitions  problems  into  equivalence 
classes.  One  of  these  equivalence  classes  is  intended  to 
correspond  to  a  set  of  problems  of  comparable  difficulty 
or  computational  complexity.  The  cost  metric  makes 
clear  how  classes  of  difficult  problems  increase  the 
expected  cost  and  how  an  agent  can  exploit  the 
presence  of  a  class  of  easy  problems. 

We  take  the  agent  to  consist  of  a  search  strategy.  The 
agent  exists  in  an  environment  in  which  problems  are 
presented  to  it  one  at  a  time.  The  agent’s  response  to 
the  current  problem  is  a  solution  to  that  problem 
obtained  by  the  search  strategy.  We  assume  that  the 
statistical  properties  of  this  sequence  of  problema 
presented  to  the  agent  are  fixed.  Specifically,  we 
assume  that  the  probability  of  members  of  a  problem 
class  being  presented  to  the  agent  are  constant  and 
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that  this  probability  is  asymptotically  approximated  by  if  N  then  there  is  some  x  such  that 

the  relative  frequency  of  the  problem  class.  xiX  and  <ix>lH. 


Let  X  be  a  countable  set  of  problems.  The  problem 
instances  that  are  presented  to  an  agent  are  members 
o(X.  Let  P  be  the  set  of  solutions  that  correspond  to  the 
problems  in  X.  We  intend  that  P  have  a  flexible 
interpretation.  The  members  of  P  may  be,  for  example, 
literal  solutions  as  a  sorted  list  is  the  solution  to  a  sort 
problem.  Alternatively  the  members  of  P  may  be 
algorithms  that  generate  such  literal  solutions. 
However  P  is  interpreted,  the  agent’s  problem-solving 
strategy  selects  members  from  P.  Without  loss  of 
generality,  every  problem  in  X  is  assumed  to  have  a 
solution  in  P.  For  the  present  purposes  we  also  assume 
that  whether  a  member  of  P  solves  a  member  of  X  is 
decidable. 

Let »  be  the  search  function  that  the  agent  uses  to 
generate  solutions  to  problems.  That  is, 

S  :  X-*P 

Given  a  problem  xiX,  six)  is  a  member  of  P  that  is  a 
solution  to  x.  Note  that  there  may  be  several  members 
of  P  that  solve  x  but  each  search  function  selects  only 
one  such  solution.  Corresponding  to  each  search 
function  s  is  a  function  c,  such  that  the  value  c«(x)  is  the 
cost  that  *  expends  to  solve  problem  x. 

C,  :  X-*R 

The  function  ct  is  real-valued  but  may  be  interpreted  as 
the  number  of  units  of  any  arbitrary  resource,  e.g., 
time,  space  or  food. 

The  strategy  a  partitions  the  problems  into  classes  on 
tiie  basis  ofthe  solutions  chosen  by  s.  We  will  consider 
two  problems  equivalent  with  respect  to  s  if  s  returns 
the  same  solution  for  both.  That  is, 

x-,y  iff  «(x)=«(y) 

We  use  [x],  to  denote  the  equivalence  class  of  x  under 

Consider,  for  example,  the  situation  in  which  X  is  a  set 
of  motion  planning  problems,  the  set  of  solutions  P  is  a 
set  of  specialized  motion  planning  algorithms  each  of 
which  generates  a  sequence  of  robot  actions.  When  the 
agent  is  presented  with  problem  x,  namely,  a 
description  of  the  environment  and  the  robot’s  initial 
and  final  state,  s  selects  some  piP,  an  algorithm  that 
yields  a  motion  plan.  The  equivalence  class  of  problem 
x,  (x]„  is  just  the  set  of  problems  for  which  s  selects 
algorithm  p  =  s(x)  to  generate  the  motion  plan. 
Without  placing  further  conditions  on  P,  the  set  of 
problems  for  which  p  generates  correct  motion  plans 
will  in  general  properly  contain  (x]t,  the  set  of  problems 
for  which  s  selects  jp  as  a  solution.  In  general  the  actual 
cost  of  executing  p  is  not  included  in  c«(x)  but  is  part  of 
the  cost  of  obtaining  a  literal  solution  for  x.  This  point 
is  discussed  further  in  $6. 

A  problem  sample  is  the  denumerable  sequence  of 
problem  instances  presented  to  the  agent  We 
represent  a  problem  sample  as  a  set  of  pairs  <ix> 
consisting  of  an  index  i  and  a  problem  instance  x.  If 
<ix>  in,  problem  instance  x  is  the  ith  element  of  the 
sequence.  Ilius,  a  problem  sample  H  is  a  subset  of 
N  X  X  with  the  following  properties: 
if  <*x>,  <iy>  i  H  then  x=y 


A  prefix  Hn  of  a  problem  sample  H  is  just  the  set  of 
problem  instances  up  to  some  point  n  in  the  sequence: 

H"  =  (<ix>|  <ix>(H  &  i*n} 


We  can  extend  the  notion  of  problem  equivalence 
classes  to  sets  of  problem  instances  ana  thus  to  problem 
samples  and  sample  prefixes.  For  problem  x  and  set  of 
problem  instances  H 

[xH],  =  {<jy>  |  <jy>iH  &  y«[x],} 

If  H  is  a  problem  sample  then  (x  H]t  is  the  subset  of  the 
sequence  whose  members  are  in  the  equivalence  class 
of  x  as  determined  by  strategy  s.  That  is,  [x  H],  is  the 
set  of  problem  instances  in  Htor  which  s  returns  the 
same  solution  as  it  returns  for  x.  If  a  particular 
problem  x  appears  more  than  once  in  H,  each  pair  that 
includes  x  is  in  [x  H]t.  Similarly  [x/r*]«  is  the  set  of 
instances  in  H B  of  problems  equivalent  to  x.  We  use 
[H].  to  denote  the  set  of  all  equivalence  classes  of 
problem  instances  of  H  with  respect  to  s: 


[H],  =  «xH],  |  3 i  <ix>iH} 

Similarly  [#"]«  is  the  set  of  all  equivalence  classes  of 
problem  instances  of  Hn  with  respect  to  «. 


Our  cost  evaluation  of  a  strategy  s  for  generating 
solutions  is  based  on  the  above  definition  of 
equivalence  classes  of  problem  instances.  The  cost 
function  for  s  qan  be  extended  in  a  simple  way  to 
members  of  a  problem  sample: 

Cf (<ix>)  =  Cg(x). 

Note  that  for  problems  x,  y  in  the  same  equivalence 
class,  we  will  not  in  general  have  c,(x)  =  cf(y).  Even 
though  s  obtains  the  same  solution  for  x  and  y,  s  may 
expend  more  effort  in  one  case  than  in  another.  Thus, 
for  a  prefix  ff”  the  average  cost  of  solving  a  problem 
instance  in  [x  ff*],,  denoted  by  c,[x  Hn]f,  is  the  sum  of 
the  cost  of  each  individual  problem  divided  by  the  size 
of  [xff"],.  That  is. 


c[ x0*J  *  -  y"  el<iy» 

*  •  * 

where  |A|  denotes  the  cardinality  of  set  A. 


Let  %(A  B)  denote  the  relative  frequency  of  members  of 
setAinsetfi.  For  finite  sets  A  ana  0,  where  A  is  a 
subset  of  0, 


«ASI= 

W 


Then  the  relative  frequency  with  which  problems  in 
[x  appear  in  hP1  is 


*0x0*1,  0") 


fr  0*1,1 

10*1 


I  Cx  0*1,1 


Consider  a  single  problem  class  q€[Hnl*.  That  is,  let  q 
be  a  set  of  equivalent  problem  instances  in  Hn  with 
respect  to  search  strategy  s.  The  part  of  the  average 
cost  incurred  by  s  on  Hr  that  is  attributable  to  q  is  just 
the  product  of  the  average  cost  that  s  expends  on 
problems  in  q  and  the  relative  frequency  of  q  in  fTh 
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eJ^VatT) 

Now  summing  over  ell  such  equivalence  classes  of 
problem  instances,  the  average  cost  incurred  by  s  to 
solve  a  problem  in  H*  is 

C(*T) »  £  ctU|)mqH',) 

Ilcjq)  la  bounded  for  all  problem  classes  q  iaH  then 
from  the  assumption  that  the  statistical  properties  of  H 
are  fixed  we  have  that 

Cm  «  liw 

is  well  defined  and  finite.  This  limit  is  the  expected  cost 
that  s  expends  to  solve  a  problem  in  H .  For  a  particular 
s  by  recording  the  average  cost  and  relative  frequency 
of  problem  equivalence  classes  we  can  evaluate  C,(/fh) 
and  approximate  Cf(H).  Thus  we  have  some 
information  about  what  performance  we  can  expect 
from  <  on  problem  sample  H.  It  is  here  that  die 
invariance  of  problem  class  frequencies  play  a  role.  As 
shown  in  the  next  section  this  performance  measure 
can  be  the  basis  for  comparing  and  evaluating  two 
different  problem  solution  strategies. 

6  Example:  A  hybrid  algorithm 

Having  defined  a  measure  of  computational  cost,  the 
question  arises  as  to  how  it  may  be  used.  The  following 
example  illustrates  the  use  of  this  measure  to  improve 
performance.  A  problem  solution  strategy  is  defined 
such  that,  given  a  problem,  it  selects  an  algorithm 
whose  output  is  the  solution  to  the  problem.  From  a 
family  of  such  strategies  we  show  how  to  choose  the 
optimal  one  according  to  the  performance  measure. 

Consider  a  sequence  of  algorithms  Ai  Ag ...  An,  each 
member  of  which  terminates  on  all  input,  takes  as 
input  an  encoding  of  a  problem  and  yields  as  ouput  a 
solution  to  the  problem  or  an  indication  of  failure.  Let 
the  sequence  have  the  additional  property  that  any 
member  of  the  sequence  solves  at  least  all  of  the 
problems  of  its  predecessor.  We  refer  to  this  property  of 
a  sequence  of  algorithms  as  the  $ub$umption  property. 
A  specific  instance  of  such  a  sequence  of  algorithms  can 
be  constructed  to  address  the  motion  planning  problem 
In  robotics.  The  motion  planning  problem,  e.g., 
[Losano-Pdres,  Wesley,  1979],  is  to  find  a  collision  free 
path  for  an  object  from  an  initial  position  to  a  desired 
final  position.  For  an  object  B,  if  we  have  a  sequence  of 
object  approximations  BtB} ...  B„  -  B  and  a  motion 
planning  algorithm  M ,  we  can  specialize  M  by  each  of 
the  Bj's  to  yield  Afi  Mq ...  Mn  so  that  Mi  computes  paths 
only  for  fij.  That  is,  Af,  computes  paths  for  the  ith 
approximation  of  object  B.  if  Bi  strictly  contains  its 
successor  in  the  sequence  and  if  M  is  suitably  well- 
behaved  then  the  Mi’s  have  the  subsumption  property. 
That  is,  since  Mi  uses  an  approximation  to  the  object 
that  is  bigger  than  that  used  by  Mi +i.  Mi  can  only  find 
a  path  if  Mi  #  j  does  (figure  6.1). 

We  are  appealing  to  the  intuition  that  giving  up  detail 
and  completeness,  i.e.,  the  property  that  a  solution  is 
found  if  one  exists,  enables  us  to  obtain  more  quickly 
the  solutions  we  do  find.  In  ascending  order  the  fi.’s  are 
increasingly  better  approximations  to  the  original 
object  Similarly,  the  Mi’s  are  increasingly  better 


Sequence  of  object  approximation* 


Situations  distinguished  by  object  approximations 


Figure  8. 1  Object  approximations  generate  motion  planning 
algorithms 

An  example  of  a  sequence  of  object  approximations  that  can  be 
used  to  define  a  sequence  of  motion  planning  algorithms  with  the 
subsumption  property. 


approximations  to  a  complete  algorithm  to  find  motion 
plans  for  the  original  object.  In  this  instance  we  may 
profit  from  giving  up  the  precision  of  a  complete 
algorithm  by  being  able  to  use  representations  of  lower 
combinatorial  complexity. 

Now  let  the  search  strategy  s  be  such  that,  given  a 
problem,  s  applies  the  Ai’s  in  turn  until  one  of  the  A,’s 
returns  a  literal  solution  to  the  problem.  Such  a  search 
strategy,  which  we  will  refer  to  as  a  hybrid  algorithm, 
la  unambiguously  specified  by  its  component 
algorithms.  Intuitively  s  has  generality  and  also 
exploits  the  presence  of  simple  problems.  The  strategy 
s  can  be  as  general  as  we  want  by  appending  to  the 
sequence  the  most  general  known  algorithm.  In 
particular,  s  can  be  made  complete  if  i  complete 
algorithm  is  known.  Because  the  simpler  and, 
presumably,  faster  approximation  algorithms  are 
applied  first,  s  solves  easy  problems  quickly.  To  some 
degree  a  hybrid  algorithm  invests  an  appropriate 
amount  of  effort  in  solvings  problem  instance. 

In  the  interests  of  applying  the  definitions  of  $5  we 
interpret »  as  selecting  one  of  the  A,’s.  That  is,  the 
solution  that  s  returns  is  a  member  of  {At  A2 . .  An}. 
The  A  j  selected  by  s  is  one  that  can  be  executed  to 
obtain  a  literal  solution  to  the  problem  but  trivially,  by 
the  time  s  selects  Ai,  we  already  have  the  literal 
solution  that  Ai  generates.  Now  according  to  the 
previous  definitions  there  is  an  equivalence  class  of 
problems  associated  with  each  of  the  A,’s.  Specifically, 
any  problem  x  is  in  the  equivalence  class  associated 
with  At  if  Ai  solves  x  (i.e.,  produces  a  literal  solution  for 
x),  but  no  Aj,i<i,  solves  x.  We  refer  to  this  equivalence 
class  by  [Ai  H]$  and  note  that  [ AiH],  =  [x  H],  for  any 
problem  x  that  t  solves  by  selecting  A,. 

As  before  %([Ai  Hn]t  H71)  is  the  relative  frequency  of 
members  of  [A,-  Hn],  in  Hn,  the  first  n  problems  of 
sample  H.  If  c (A  j  H")  is  the  average  cost  of  executing 
Ai  on  the  problems  of  Hn  then 

i*i 

This  is  the  average  cost  that  $  incurs  when  s  is  applied 
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to  H*  and  (elects  A*.  The  total  cost  C*(/f")  that  s  incurs 
on /Pis 

»  £  e  do  H,l  )%<Ia  H\ 

•^i-V  * 

Given  this  measure,  it  can  be  determined  if  each 
component  algorithm  isproviding  more  benefit  than 
the  cost  it  is  incurring.  That  is,  although  the  expected 
cost  of  solving  a  given  problem  with  any  A*,  k  <i,  is  less 
than  with  A  j,  the  cost  of  executing  each  such  A*  is 
incurred  in  solving  every  problem  from  the  class 
(A  j  Hn)t.  Classes  that  are  too  expensive  or  of  low 
relative  frequency  can  be  dropped  from  s  to  yield  a  new 
strategy  that  has  better  overall  performance.  Given 
measurements  of  the  relative  frequencies  and  costs  for 
the  problem  classes  a  dynamic  programming  method 
can  be  used  to  select  the  subset  of  component 
algorithms  with  lowest  expected  cost  for  the  current 
problem  sample  [Hartman,  1987],  Note  that  if  we  take 
a  hybrid  algorithm  that  is  optimized  in  this  way  for  a 
particular  problem  sample  andapply  it  to  a  sample 
with  different  statistical  properties,  not  surprisingly, 
we  are  likely  to  obtain  different  performance. 

In  addition  to  obtaining  a  globally  optimal  set  of 
component  algorithms  for  a  particular  problem  sample, 
we  can  also  determine  under  what  conditions  an 
incremental  change  to  s,  e.g.,  the  insertion  of  a  new 
component  algorithm  into  the  sequence,  yields  an 
improvement  on  the  expected  cost  per  problem.  That 
is,  we  know  where  candidates  for  future  improvements 
may  be  found. 

We  do  not  claim  that  this  is  the  best  way  to  solve  any 
particular  kind  of  problem.  The  optimization  is  only 
within  the  narrowly  defined  class  of  hybrid  algorithms. 
However,  the  example  does  show  how  the  cost  measure 
defined  in  the  previous  section  can  be  used  to  optimize 
performance.  In  particular,  the  performance 
improvement  is  independent  of  the  actual 
representations  and  component  algorithms  used.  It 
does  not  depend  on  increasing  knowledge  about  how  to 
solve  the  particular  problem  such  as,  in  this  case, 
motion  planning.  The  improvement  ia  based  solely  on 
the  invariant  statistical  properties  of  the  problem 
sample  and  measurement  of  performance. 

7  Conclusion 

Agents  in  real  world  domains  will  be  called  upon  to 
solve  difficult  problems.  Although  an  agent  might 
choose  to  expend  arbitrarily  large  amounts  of  a 
resource  in  solving  a  particular  problem,  the 
opportunity  coats  of  doing  so  typically  make  such  a 
choice  prohibitively  expensive.  This  resource  would  be 
better  spent  in  either  solving  approximations  of  this 
problem,  ignoring  this  problem  completely  and  solving 
other,  easier  problems,  or  improving  the  agent’s 
general  ability  to  solve  these  difficult  problems.  This  is 
as  true  for  agents  who  introspect  upon  their  own 
problem  solving  ability  as  it  is  for  designers  who 
attempt  to  ascertain  the  utility  of  AI  systems  that  they 
develop.  We  argue  that  these  performance  choices 
should  be  made  explicit,  and  that  they  should  be  based 
upon  the  resource  cost  incurred  in  solving  actual 
problem  samples. 


We  have  provided  one  example  of  a  resource  cost 
measure.  Although  by  no  means  definitive,  it 
measures  the  search  costs  associated  with  finding  the 
solution  to  a  given  sample  of  problems  using  a  given 
search  strategy.  As  opposed  to  previous  formal 
analyses  of  search  strategies  within  artificial 
intelligence,  the  emphasis  here  is  not  upon  the 
efficiency  of  the  solution,  as  it  is  in  A*,  but  upon  the 
amount  of  resources  expended  in  finding  solutions. 
Central  to  this  endeavor  is  the  concept  of  dividing  a 
problem  space  into  a  set  of  problem  classes.  We  can 
measure  both  the  frequency  with  which  a  search 
strategy  places  problems  from  the  sample  within  a 
class,  ana  the  average  cost  of  solving  each  problem 
placed  into  a  class.  This  allows  one  to  demonstrate 
improved  performance  by  changing  the  search  strategy 
to  place  more  of  the  problems  into  those  classes  for 
which  solutions  can  quickly  be  generated,  or  to 
ascertain  more  quickly  into  which  class  a  problem 
belongs.  Cost  measures  such  as  the  one  described  here, 
will  be  necessary,  we  believe,  in  order  to  maximize  the 
resource  use  of  an  agent  which  must  solve  many 
problems  drawn  from  an  intractable  class. 
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ABSTRACT 


Earlier  papara  (  Allan  and  Bayaa  1985,1986) 
described  a  compact  axiomatic  theory  which 
provided  a  formal  basis  for  tesporal  reasoning. 
This  theory  makes  a  sharp  distinction  between 
time  points  and  periods  of  time.  Me  show  here, 
by  considering  possible  models  of  the  theory, 
that  it  can  be  slightly  extended  to  give  a  better 
fit  to  intuition.  In  particular,  the  extended 
theory  la  preserved  under  changes  of  tenoral 
granularity. 

TIMEPERXOOS  AMD  MUTING 


The  original  period  theory  (Allen  1984,  Allen 
t  Bayes  1986  )  need  periods  rather  than  points  as 
its  temporal  primitive.  (  The  terminology  used 
earlier  was  'interval'  rather  than  'period'.  Me 
have  changed  to  avoid  confusion  with  the 
mathematical  use  of  ' interval' .  )  The  thirteen 

possible  relationships  between  periods,  including 
for  example  overlapping,  inclusion,  and  before 
are  defined  in  terms  of  MEETS,  which  wa  will 
write  ae  in  infix  colon,  with  p:q:r  meaning  p:q 
and  q:r.  The  basic  axioms  of  the  theory  are  then 
as  follows! 

Ml  fossil  p.g,r,s.  (  piq  and  pis  and  rig  ) 

Implies  sis 

M2  forall  p, q, r, s.  (  piq  and  ris  >  implies 

(  p:a 

xor  exists  t.  p:t:s 
xos  exists  t.  rittq  ) 

M3  forall  p.  exists  q,r.  q:p:r 

M4  forall  p,q,r,s.  (  p:qss  and  pins  ) 

implies  q-r 

MS  forall  p.q.  ptq  implies 

exists  r,s.  r:piq:s  and  r:(p+q):s 


These  five  axioms  are  all  the  assumptions  which 
the  theory  makes.  For  a  longer  discussion  of 
their  implications  and  justification,  see  (  Allen 

and  Bayes  1986  ) . 

A  key  Intuition  is  that  propositions  are  true 
or  false  during  periods,  rather  than  at  points  of 
time.  This  overcoaMS  the  problem  of  the  'divided 
instant'  (Van  Benthsm  1982) .  Consider  switching 
on  a  light,  so  that  the  proposition  "light  on"  is 
first  falsa,  then  immediately  true.  If  the 
timeperiods  are  thought  of  as  sets  of  points, 
then  only  artificial  constructions  can  avoid  the 
dilemma  of  there  being  a  point  at  which  the  light 
is  neither  on  nor  off  (  or,  worse,  both  on  and 
off  ) . 

However,  tlmepoints  can  be  defined  within  the 


theory  as  the  'places'  where  periods  meet.  In 
(Allan  6  Bayes  1986  ) ,  a  set-theoretic 
construction  is  given  of  points  from  periods,  but 
we  can  introduce  points  directly,  and  it  can  be 
shown  that  the  theory  obtained  is  identical.  Me 
will  use  variable  nasms  u,v,w, ..  for  points, 
p,q,r,..for  periods,  and  introduce  two  functions 
START  and  END  from  periods  to  points. 

PI  forall  p.q.  p:q  iff  end (p) -start (q) 

P2  forall  u  exists  p.  u-end(p) 

It  is  possible  now  to  show  that  points,  whatever 
they  are,  have  all  the  properties  one  would 
directly  expect,  such  as  being  a  totally  ordered 
infinite  set  with  no  limits. Me  could  also  start 
with  points  as  the  primitive  idea  and  define 
periods  as  pairs  of  points  from  the  totally 
ordered  set.  In  (  Allen  and  Bayes  1986  )  we  show 
how  the  natural  construction  ot  periods  from 
points  by  forming  pairs  <u,v>  is  a  proper  inverse 
to  the  derivation  of  points  from  periods  defined 
by  these  axioms,  so  that  such  intuitively 
compelling  results  as  pxatart  (p)  ,end(p)>  are 
provable  from  the  axioms.  These  points,  however, 
are  mere  mathematical  abstractions,  and  not  times 
at  which  an  event  occurs  or  when  soam  proposition 
is  true- 

POINTS  AMD  MOMENTS 

Points  in  time  are  places  where  periods  meet, 
but  they  arent  theamelves  periods,  not  even  very 
short  ones.  A  time period  is  the  sort  of  thing 
that  an  event  might  occupy:  it  has  substance, 
while  a  point  is  merely  an  abstraction.  Consider 
for  example  a  bail  tossed  into  the  air  on  the  one 
hand,  and  a  flash  of  lightning  on  the  other.  The 
time  of  the  balls  flight  can  be  divided  into  two 
periods,  one  of  rising,  the  other  of  falling. 
The  ball  spends  no  actual  time  at  the  top  of  its 
flight:  it  does  not  hover  there  for  a  period,  no 
matter  how  small.  The  periods  of  rising  and 
falling  MEET  each  other  directly.  The  point  at 
their  meeting  is  a  conceptual  abstraction,  not  a 
real  physical  tlaeperlod.  In  contrast,  the  time 
taken  by  a  flash  of  lighting,  although  pointlike 
in  many  ways,  must  be  a  period  because  it 
contains  a  real  physical  event.  Other  things  can 
happen  at  the  sasw  time  as  the  flash,  such  as  a 
photograph  being  exposed  or  a  horse  dying.  This 
is  how  we  characterise  such  very  brief  moments  of 


forall  p.  Moment (p)  iff  not  exists  q, r.  p-q+r 
le,  a  moment  is  indivisible  into  subperiods. 

Moswnts  have  many  of  the  properties  of  points. 
Tor  example,  if  a  period  has  moments  at  its  ends 
then  they  are  unique,  and  they  uniquely  define 
the  period  between  them,  by  MS.  But  they  also 
differ  in  many  ways.  For  example,  being  periods, 
they  have  distinct  endpoints. 
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Th«  theory  developed  so  far  la  somewhat 
unintuitive  on  thla  natter  of  truth  at  points  and 
the  qualities  of  moments.  On  one  hand,  sone 
points  seen  to  be  natural  receptaeles  of  truth, 
and  on  the  other,  it  is  sonetimes  natural  to 
treat  a  nonant  as  being  more  pointlike.  For 
example, the  tossed  balls  vertical  velocity  is 
positive  on  the  way  up,  negative  on  the  way  down, 
so  nust  be  sero  sonewhen,  by  the  internediate 
value  theoren:  and  the  point  where  these  periods 
neet  is  the  obvious  candidate.  Qualitative 
reasoning  uses  such  points  liberally,  and  it  is 
hard  to  see  how  it  could  do  without  then.  On  the 
other  hand,  it  seens  unnatural  to  distinguish  the 
start  fron  the  end  of  a  lightning  flash,  but  we 
nust  if  it  is  to  occupy  nore  than  a  point. 

We  adght  try  to  rescue  the  fornalisn  by 
'inserting'  a  nonant  whenever  a  proposition  needs 
to  be  true  at  a  point.  The  intuitive  argument 
here  would  be  that  since  the  ball  is  notionlasa, 
there  anist  be  a  period  -  perhaps  an  'infinitely 
short'  one,  whatever  that  means  -  for  it  to  be 
motionless  in.  But  this  leads  to  the  dual 
problem:  just  as  points  ace  too  snail  to  be 
occupied  by  events,  moments  (  as  we  have  defined 
then  )  are  too  long  to  be  suitably  infinitesimal . 
Since  a  nonant  is  a  period,  it  has  distinct 
endpoints,  even  though  no  points  inside  it.  We 
cant  identify  these  points  without  denying  the 
existence  of  the  moment  itself,  for  to  identify 
then  is  to  claim  that  periods  before  and  after 
the  moment  MEST,  by  FI.  Thus,  the  tian  the  ball 
cosms  to  rest  and  the  tian  it  starts  to  fall 
would  be  separated  by  an  interval  during  which 
the  ball  would  be  hovering,  an  unacceptable 
consequence.* 

We  seen  to  need  to  be  able  to  treat  amenta  as 

points,  and  vice  versa,  to  capture  intuition 
better. 

MAtrim  MOMENTS  WTO  POORS 

Suppose  we  introduce  a  napping  which 
identifies  the  endpoints  of  moments,  so  that 
moments  are  napped  into  points.  This  is  what 
Bobbs  (IMS)  calls  a  change  of  granularity.  The 
tolerance  relation  which  defines  it  in  this  case 
is  that  two  periods  are  indistinguishable  if 
their  endpoints  are  at  s»st  a  moment  apart.  We 
can  define  this  as  follows: 

forall  u, v.  u--v  iff  u— v 

or  Honant«u.v>) 
or  Moment (<v,u>) 

forall  p,q.  p— q  iff  start  (p) — start  (q) 
and  end (p)  — end (q) 

As  Hobbs  notes,  the  change  of  granularity  is 
well-understood  only  in  the  case  where  the 
tolerance  relation  is  transitive,  and  hence  ia  an 
equivalence  relation.  It  is  easy  to  see  that  — 
is  transitive  just  when  aments  never  neet: 

MC  forall  n,n.  Moment (ml  and  Moment (n)  implies 

not  n:n 

(  for  if  two  aments  awet,  then  the  start  of  the 
first  does  not  —  the  end  of  the  second  ) .  If  we 
add  this  to  our  axioms,  then  the  extended  theory 
is  preserved  under  the  blurring  which 
Introduces.  We  prove  this  by  defining  relations 
which  'imitate'  those  of  the  basic  theory  but 
with  a  aments  'blur'  at  their  ends. 


•  This  is  sometimes  what  naive  intuition 
predicts,  in  fact:  the  'read-runner  effect'. 


forall  p,q.  p: :q  iff  end(p> — start (qt 

or  equivalently 

forall  p, q.  p::q  iff  p:q 

or  axista  n.  Moment (n)  and 
p:m:q 

or  »*overlap(p,q) 

forall  p.q.n.  p:q  implies  p++q  -  p+q 
and 

(  Moment (n)  and  p:n:q  ) 

implies  p*-*q  -  p+m+q 


Mow,  let  Ml'  through  M6'  be  the  result  or 
rewriting  Ml  through  M<  with  :,  *  and  -  replaced 
by  : : ,  h  and  — — .  Then  we  have 


THEOREM.  M1-M6  entails  Ml'-M6' 


The  proof  is  elementary  but  tedious  and  omitted 
hare. 

It  is  easy  to  see  that  if  —  is  equality,  then 
the  relation  : :  reduces  to  the  primitive  :  and 
similarly  ♦♦  becomes  the  same  as  ♦  .  Hence,  the 
axioms  Ml '  -MS'  nay  be  regarded  as  the  theory  of 
—  -equivalence  classes. 

This  result  shows  that  the  theory  with  Ht 
applies  uniformly  at  any  level  of  coarsness  in 
which  omenta  are  identified  with  their 
endpoints.  We  can  reason  about  periods  and  nove 
flexibly  from  one  level  of  acuity  to  another, 
needing  only  a  truth-maintenance  systen  to  keep 
track  of  the  various  granularity  assumptions 
being  node  from  time  to  time . 


This  seams  to  provide  a  solution  to  the 
intuitive  problems  with  which  we  began.  A  moment 
can  be  treated  as  a  point,  or  vice  versa,  without 
changing  the  axioms  of  the  temporal  theory.  The 
extended  theory  is  stable  under  changes  of  scale. 
Notice  that  since  the  blurred  relation 
Includes  the  basic  meets  relation,  we  could  use 
both  relations  in  a  description,  and  the  theory 
would  still  be  preserved  under  : :  collapsing,  so 
that  we  do  not  have  to  treat  all  the  moments  as 
points  or  vice  versa.  Thus,  in  the  balltoss 
example,  the  topmost  point  can  be  talked  of  as 
though  it  were  a  moment,  but  all  other  points 
retain  their  Inferior  status  as  real  points  which 
do  not  expand  into  axxnents  and  in  which  nothing 
can  be  asserted.  Or  as  an  alternative,  we  could 
claim  that  all  points  have  this  expandable 
status,  and  then  be  forced  into  describing  the 
door's  status  at  the  moment  it  closes. 


Intuitive  arguments  for  the  reasonableness  of 
M<  include  the  idea  of  perceptual  acuity.  When 
two  dots  are  close  enough,  we  perceive  than  as 
one.  Imagine  three  dots  x,y,s  with  x  and  y  so 
close  that  they  would  seen  dot-like  if  sees 
alone,  and  similarly  y  and  a.  One  now  perceives 
a  short  line,  a  dash.  Two  adjacent  spatial 
'aments'  become  a  short,  dense,  'period'. 
Adjacent  'aments'  cannot  be  perceived. 
Similarly,  for  exaapla,  a  succession  of  flashes 
with  no  periods  between  them  are  seen  as  s 
continuous  light  filling  an  period,  which  is  why 
the  amvies  work. 

However,  it  is  net  yet  clear  what  the  axxiels  of 
the  theory  with  M6  are.  In  the  next  section  we 
will  show  that  they  are  similar  to  the  usual 
model  of  continuous  time  as  the  rational  or  real 
line. 
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MODELS  or  THE  THEORY 

The  axiom  M6  seems  rather  strange  at  fir  at 
tight,  ainca  in  tha  two  moat  obvious  catagotiaa 
of  nodal  of  tha  baaic  thaoty,  it  ia  aithar 
vacuoua  ot  falaa.  Thaaa.  ara  raapactivaly  tha 
dense  and  diacrata  models.  In  tha  lattar,  time  ia 
clock  time,  a  sequence  of  diacrata  tlcka.  Whila 
many  nonatandard  nodala  exist,  tha  simplest  ia  of 
couraa  tha  intagara.  In  thaaa  nodala,  avary 
moment  meets  tha  naxt  moment,  and  ao  M6  ia  aa 
falaa  aa  it  can  ba,  everywhere.  On  tha  othar 
hand,  in  a  danaa  nodal,  avary  pariod  can  ba  aplit 
into  conaacutiva  aubparioda,  ao  no  moments  axiat, 
and  M<  ia  trua  but  boring.  However,  thara  ia  a 
nontrivial  daaa  of  nodala  which  ara  naatly 
character iaad  by  M(. 

In  a  danaa  nodal,  tha  pointa  muat  fora  an 
unbounded  danaa  linear  order. If  tha  nodal  ia 
countabla  (  and  ona  alwaya  ia  )  than  thia  ia  tha 
rational  line,  but  avarything  hara  appliaa 
equally  well  to  tha  raala.  Periods  then  can  ba 
interpratad  aa  open  iptarvala,  with  (a,b):(b,c) 
defining  tha  swats  relation.  Tha  fact  that  tha 
intarvala,  conaidared  aa  aata  of  pointa,  dont 
contain  their  andpointa,  ia  not  relevant  to  the 
way  in  which  they  it and  for  parioda  in  thia 
nodal.  An  exactly  iaonorphic  nodal  ia  obtained 
by  interpreting  parioda  aa  cloaad  intarvala  with 
(a,b):(b,c]  ,  for  example. 

Motlca  howavar  what  happena  if  we  include  ALL 
rational  intarvala,  opan  and  cloaad,  and  try  to 
dafina  HEETing  in  tha  intuitively  atraightforward 
way  which  takaa  account  of  intarvala  aa  aata  of 
pointa.  Taka  for  exaaple  tha  Intarvala 

<a,b)  (a,bj  (b,c)  (b.c) 

eallad.  aay,  p.q.r  and  a  raapactivaly .  Than  wa 
would  want  it  to  ba  true  that  p:r  and  q:a,  but 
not  that  p:a.  It  followa  than  (  f root  m2  and  a 
little  work  )  that  there  nuat  ba  an  pariod  t  auch 
that  p:t:a.  Tha  only  candidata  for  auch  an 
pariod  ia  tha  ainglaton  (b),  ia  tha  cloaad 
interval  (b] .  How  thia  must  ba  a  moment,  ainca 
evidently  [b]  haa  no  aubparioda.  So  thia 
interpretation  ia  not  a  modal  of  tha  danaa 
theory.  On  tha  othar  hand,  it  certainly  ia  not 
iaonorphic  to  tha  intagara  of  tha  diacrata  modal. 
In  fact,  thia  ia  a  (rather  extrema)  example  of  a 
third  clasa  of  interpretationa,  in  which  momenta 
are  intuitively  point -like. 

Tha  general  form  of  a  model  of  thia  aort  - 
call  it  a  PACKED  modal  -  over  tha  rationale,  ia 
aa  followa.  Tha  doom  in  of  parioda  containa  opan 
rational  Intarvala  and  parhapa  aome  ainglaton 
cloaad  intarvala,  and  all  Intarvala  which  can  ba 
generated  by  taking  tha  intariora  of  tha  uniona 
of  the  cloaurea,  and  tha  aiapla  concatenation,  of 
other  intarvala.  Thua  if  it  containa  (a,b)  and 
(b.c)  and  (dl,  than  it  will  alao  contain  (a,c). 
(b, d)  and  (a,d),  for  example.  MEETS  la  defined 
aa  followa;  if  (b]  ia  not  in  tha  domain,  than 
(a,b):(b,c)/  but  otharwiaa,  meat*  ia  interpreted 
aettheoretlcally,  ao  that  (a,b) : (b) : (b,c)  ,and 
<a,b]:(b, cl,  ate..  At  ona  extrema,  a  packed 
nodal  ia  aimply  a  danaa  awdel  whan  no  cloaad 
ainglaton  inatanta  ara  praaant.  At  tha  othar,  it 
ia  tha  fully  aat -theoretical  interpretation  in 
which  parioda  ara  rational  Intarvala .  But  many 
othara  ara  poaaible,  for  example  ona  in  which  tha 
only  ainglaton  momenta  are  tha  intagara,  or  where 
parta  of  tha  line  ara  danaa,  parta  danaa  with 
momenta;  or  where  juat  a  few  laolatad  pointa  ara 
dlatingulahed  aa  being  tha  intariora  of  aeoanta. 


Thaaa  ainglaton  Intarvala,  being  parioda,  have 
diatinct  andpointa.  Thua  in  a  crowded  model,  tha 
axioma  FI,  F2  aaaart  the  axiatance  of  'points'  on 
either  aide  of  a  rational  point:  atartl (a))  and 
end([a)).  Tha  granularity-coarsening  mapping 
defined  ealrlier  can  ba  seen  aa  making  tha  claim 
that  thia  distinction  ia  irrelevant . 

It  followa  from  tha  earlier  result  that  thara 
la  no  way  of  formalising  tha  distinction  between 
thaaa  point-intervals  and  more  substantial 
moments  within  tha  vocabulary  of  tha  time  theory 
f o  far  developed.  Ona  can  sea  this  intuitively 
by  observing  that  any  modal  which  containa  auch 
instants  can  ba  replaced  by  a  modal  in  which 
inatanta  ara  real  intarvala,  amply  by 
'stretching'  tha  line  apart  at  those  positions 
and  inserting  a  gap.  Exactly  tha  same  aantancaa 
in  tha  language  would  ba  made  trua  by  such  a 
stretched  modal,  ainca  thaaa  enlarged  gaps  would 
contain  no  meeting-points  referred  to  by  tha 
aantancaa  of  tha  theory,  ao  would  ba  momenta  juat 
aa  before;  and  tha  way  they  interact  with  tha 
othar  periods  will  ba  unchanged.  In  order  to 
axpraas  tha  distinction,  wa  would  need  to 
introduce  soma  way  of  talking  about  tha  aixa  or 
duration  of  parioda,  and  than  point-lnt  -  /ala  ara 
moments  with  no  duration. 

Tha  overall  picture  ia  thia.  Tha  axiom  M6 
forces  ua  to  interpret  momenta  as  things  with  no 
extent,  merely  pointlike  objects,  and  alao 
exactly  permits  us  to  systematically  treat  them 
aa  points  within  tha  same  theory,  by  a  simple 
change  of  granularity.  Thus  thara  can  ba  a 
moment  during  which  tha  toaaad  balls  velocity  is 
zero,  and  wa  can  conaiatantly  treat  it  as  though 
it  ware  a  point.  And  whan  wa  examine  tha  models 
of  our  aasuaptiona,  thara  turns  out  to  ba  a 
single  point,  in  fact,  where  tha  moment  should 
ba.  And  yet  othar  parioda  can  maet  directly, 
whan  time  ia  danaa,  without  there  being  momenta 
between  them.  So  wa  can  have  our  sonants  and  aat 
them. 
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A  Formal  Theory  of 
Plan  Recognition 

Henry  Alexander  Kautz 
Abstract 

Research  in  discourse  analysis,  story  understanding,  and  user  modeling  for 
expert  systems  has  shown  great  interest  in  plan  recognition  problems.  In  a  plan 
recognition  problem,  one  is  given  a  fragmented  description  of  actions  performed  by  one 
or  more  agents,  and  expected  to  infer  the  overall  plan  or  scenario  which  explains  those 
actions.  This  thesis  develops  the  first  formal  description  of  the  plan  recognition 
process. 

Beginning  with  a  reified  logic  of  events,  the  thesis  presents  a  scheme  for 
hierarchically  structuring  a  library  of  event  types.  A  semantic  basis  for  non-deductive 
inference,  called  "minimum  covering  entailment",  justifies  the  conclusions  that  one  may 
draw  from  a  set  of  observed  actions.  Minimum  covering  entailment  is  defined  by 
delineating  the  class  of  models  in  which  the  library  is  complete  and  the  set  of  unrelated 
observations  is  minimized.  An  equivalent  proof  theory  forms  a  preliminary  basis  for 
mechanizing  the  theory.  Equivalence  theorems  between  the  proof  and  model  theories 
are  presented.  Minimum  covering  entailment  is  related  to  a  formalism  for  non¬ 
monotonic  inference  known  as  "circumscription".  Finally,  the  thesis  describes  a 
number  of  algorithms  which  correctly  implement  the  theory,  together  with  a  discussion 
of  their  complexity. 

The  theory  is  applied  to  a  number  of  examples  of  plan  recognition,  in  domains 
ranging  from  an  operating  system  advisor  to  the  theory  of  speech  acts.  The  thesis 
shows  how  problems  of  medical  diagnosis,  a  similar  kind  of  non-deductive  reasoning, 
can  be  cast  in  the  framework,  and  an  example  previously  solved  by  a  medical  expert 
system  is  worked  out  in  detail. 

The  analyses  provides  a  firm  theoretical  foundation  for  much  of  what  is  loosely 
called  "frame  based  inference",  and  directly  accounts  for  problems  of  ambiguity, 
abstraction,  and  complex  temporal  interactions,  which  were  ignored  by  previous  work. 
The  framework  can  be  extended  to  handle  difficult  phenomena  such  as  errors,  and  can 
also  be  restricted  in  order  to  improve  its  computational  properties  in  specialized 
domains. 
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Pofemes  humoristiques  sur  l’AI 


If  you're  dull  as  a  napkin,  don't  sigh; 

Make  your  name  as  a  "deep"  sort  of  guy. 
You  just  have  to  crib,  see 
Any  old  book  by  Kripke 
And  publish  in  AAAI. 

A  hacker  who  studied  ontology 
Was  famed  for  his  sense  of  frivolity. 

When  his  program  inferred 
That  Clyde  ISA  bird 

He  blamed  -  not  his  code  -  but  zoology. 

If  your  thesis  is  utterly  vacuous 
Use  first-order  predicate  calculus. 

With  sufficient  formality 
The  sheerist  banality 

Will  be  hailed  by  the  critics:  "Miraculous!" 

If  your  thesis  is  quite  indefensible 
Reach  for  semantics  intensional. 

Your  committee  will  stammer 
Over  Montague  grammar 
Not  admitting  it's  incomprehensible. 
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Chapter  1 
Introduction 


1.1.  Motivation 

Perhaps  the  central  concern  of  Artificial  Intelligence  is  to  devise  methods  for 
representing  and  reasoning  about  actions  and  plans.  While  plan  synthesis  has  received 
careful  formal  analyses  [McCarthy  &  Hayes  69],  the  inverse  problem  of  plan  recogni¬ 
tion  (or  action  interpretation)  has  appeared  in  mainly  empirical  and  domain-specific 
programs  of  research.  These  include  work  on  story  understanding  [Bruce  81, 
Wilensky  83],  psychological  modelling  [Schmidt  78],  natural  language  pragmatics 
[Allen  83,  Litman  84],  intelligent  computer  system  interfaces  [Huff  &  Lesser  82],  and 
strategic  planning.  In  each  case,  one  is  given  a  fragmented,  impoverished  description 
of  the  actions  performed  by  one  or  more  agents,  and  expected  to  infer  a  rich,  highly  in¬ 
terrelated  description.  The  new  description  fills  out  details  of  the  setting,  and  relates  the 
actions  of  the  agents  in  the  scenario  to  their  goals  and  future  actions.  The  result  of  the 
plan  recognition  process  can  be  used  to  generate  summaries  of  the  situation,  to  help  (or 
hinder)  the  agent(s),  and  to  build  up  a  context  for  use  in  disambiguating  further  obser¬ 
vations.  This  thesis  develops  a  formal  analysis  of  plan  recognition.  The  analysis  pro¬ 
vides  a  firm  foundation  for  much  of  what  is  loosely  called  "frame  based  inference" 
[Minsky  75],  and  directly  accounts  for  problems  of  ambiguity,  abstraction,  and  com¬ 
plex  temporal  interactions,  which  were  ignored  by  previous  approaches. 

Plan  recognition  problems  can  be  classified  as  cases  of  intended  or  keyhole 
recognition  [Cohen,  Perrault,  &  Allen  81].  In  the  first  case,  but  not  the  second,  the 
observer  can  assume  that  the  agent  is  deliberately  structuring  his  activities  in  order  to 
make  his  intentions  clear.  Recognition  problems  can  also  be  classified  as  to  whether 
the  observer  has  complete  knowledge  of  the  domain,  and  whether  the  agent  may  try  to 
perform  erroneous  plans  [Pollack  86].  This  thesis  concentrates  on  keyhole  recognition 
of  correct  plans,  where  the  observer  has  complete  knowledge.  We  will  consider,  how¬ 
ever,  some  examples  from  discourse,  which  are  cases  of  intended  recognition. 

An  important  preliminary  step  is  to  define  the  scope  of  inferences  which  must 
be  treated  by  a  theory  of  plan  recognition.  Plan  synthesis  can  often  be  viewed  as  purely 
hypothetical  reasoning  (i.e.,  if  I  did  A,  then  P  would  be  true).  Some  attempts  have 
been  made  [Chamiak  85]  to  formalize  plan  recognition  as  a  similar  kind  of  hypothetical 
reasoning:  infer  a  plan  P,  such  that  if  the  agent  did  P,  then  he  would  do  the  observed 
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action  A.  Only  a  space  of  possible  inferences  is  outlined,  and  little  or  nothing  is  said 
about  why  one  should  infer  one  conclusion  over  another,  or  what  one  should  conclude 
if  the  situation  is  truly  ambiguous.  (Such  criticism  also  applies  to  work  based  on 
"plausible"  inference  [Allen  83,  Cohen  84,  Pollack  86].)  We  insist  that  a  theory  of  plan 
recognition  specify  what  conclusions  are  absolutely  justified  on  the  basis  of  the  obser¬ 
vations,  our  knowledge  of  actions,  and  other  explicit  assumptions.  In  fact,  our  frame¬ 
work  allows  one  to  draw  conclusions  based  on  the  class  of  simplest  plans  which  con¬ 
tain  the  observed  actions.  Finally,  while  planning  can  be  formalized  as  pure  deduction, 
to  formalize  plan  recognition  we  must  develop  a  non-deductive,  non-monotonic1  sys¬ 
tem  of  inference. 

An  advantage  of  this  approach  is  that  the  model  and  proof  theories  apply  to  al¬ 
most  any  situation.  They  handle  disjunctive  information,  multiple  concurrent  (and  un¬ 
related)  plans,  steps  "shared”  between  plans,  abstract  event  descriptions,  and  both  in¬ 
cremental  and  non-incremental  recognition.  The  corresponding  algorithmic  theory  may 
place  restrictions  on  the  form  of  information  handled  in  order  to  gain  processing  effi¬ 
ciency.  The  search  for  more  general  and  efficient  algorithms  can  continue  without  the 
need  to  revise  or  reinvent  the  basic  principles  upon  which  the  theory  of  recognition  is 
founded. 

The  vocabulary  that  has  been  used  to  describe  plan  recognition  varies  consider¬ 
ably.  We  will  speak  uniformly  of  observations  as  descriptions  of  events.  The  observ¬ 
er’s  knowledge  is  represented  by  a  set  of  first-order  statements  called  an  event  hierar¬ 
chy.  The  result  of  the  recognition  process  is  a  description  of  the  end  events,  those 
which  are  self-contained  and  seif-justifying,  which  make  up  the  situation.  The  wide 
applicability  of  the  event  vocabulary  suggests  that  this  work  is  relevant  to  areas  of 
Artificial  Intelligence  not  normally  associated  with  planning  or  plan  recognition,  such  as 
diagnostic  reasoning.  We  will  examine  a  simple  medical-diagnosis  problem  using  our 
logical  machinery. 

1.2.  Overview  of  Thesis 

An  event  hierarchy  is  a  collection  of  restricted-form  first-order  axioms,  used  to 
define  the  abstraction,  specialization,  and  functional  relationships  between  various 
kinds  of  events.  The  functional,  or  "role/value",  relationships  include  the  relation  of  an 
event  to  its  component  subevents.  There  is  a  distinguished  type-predicate,  End,  which 

1 A  system  is  non-monotonic  if  some  conclusions  which  may  be  drawn  from  a  given  set  of 
assumptions  may  no  longer  be  drawn  from  a  larger  set  of  assumptions. 
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holds  of  events  which  are  not  components  of  any  other  events.  Recognition  is  the 
problem  of  classifying  the  End  events  which  generate  a  set  of  observed  events.  The 
second  half  of  this  chapter  reviews  research  in  plan  recognition  in  many  different  areas 
of  Artificial  Intelligence,  as  well  as  related  work  in  non-monotonic  inference. 

Chapter  2  formally  defines  event  hierarchies,  and  shows  how  they  may  be  used 
to  represent  hierarchically-structured  plans  of  action.  It  argues  for  the  particular  method 
of  representation,  and  describes  the  expressive  limitations  of  alternative  systems.  Event 
hierarchies  can  encode  conditional  actions  without  additional  machinery.  A  hierarchy 
of  actions  involved  in  cooking  is  developed  in  detail,  and  serves  as  a  basis  for  examples 
throughout  the  rest  of  the  thesis. 

An  event  hierarchy  does  not,  however,  by  itself  justify  inferences  from  obser¬ 
vations  to  End  events.  The  lexical  hierarchy  justifies  deductions  from  an  event  to  its 
components;  it  does  not  rule  out  the  possibility  that  those  components  may  occur  with¬ 
out  reference  to  any  of  the  End  events  mentioned  in  the  hierarchy.  Consider  the  fol- 
lovying  example.  (The  thick  grey  arrows  denote  abstraction  or  "isa",  and  the  thin  black 
arrows  denote  component  or  "has  part".) 


End 


figure  1.1:  Hunt! Rob  Hierarchy 

Suppose  GetGun(C)  is  observed.  This  statement,  together  with  the  hierarchy,  H,  does 
not  entail  3x  .  Hunt(x),  or  3x  .  Hunt(x)  v  RobBank(x),  or  even  3x  .  End(x). 
There  are  models  of  (GetGun(C)}  vj  H  in  which  none  of  these  statements  hold.  For 
instance  (where  we  describe  a  model  by  listing  its  positive  atoms),  none  hold  in 
(GetGun(C)},  and  only  the  last  in  (GetGun(C),  CashCheck(D),  GoToBank(sl(D)), 
End(D)}. 

Yet  it  does  seem. reasonable  to  conclude  that  someone  is  either  hunting  or  rob¬ 
bing  a  bank,  on  the  basis  of  the  given  hierarchy.  This  conclusion  is  justified  by  as- 
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sinning  that  the  event  hierarchy  is  complete:  that  is,  whenever  a  non-End  event  occurs, 
it  must  be  part  of  some  other  event,  and  the  relationship  from  event  to  component 
appears  in  the  hierarchy.  This  completeness  assumption  can  be  expressed  by  defining  a 
special  subclass  of  models  of  H,  called  "covering  models".  For  the  example, 
{GetGun(C),  Os  1(D),  Hunt(D),  GoToWoods(s2(D)),  End(D)}  and  (GetGun(C), 
Os  1(D),  RobBank(D),  GoToBank(s2(D)),  End(D)}  are  covering  models  of  H,  but 
none  of  the  other  models  described  above  are. 

Chapter  3  uses  die  notion  of  a  covering  model  is  used  to  define  a  new  semantic 
relation,  called  c-entailment.  In  this  example,  hunting  or  bank  robbing  occurs  in  all 
covering  models  in  which  an  agent  gets  a  gun,  or  formally: 

GetGun(C)  f®c  Ex  .  Hunt(x)  v  RobBank(x) 

The  second  part  of  that  chapter  relates  c-entailment  to  ordinary  entailment  and  deduc¬ 
tion.  An  easily-computed  "closure"  function  cl  is  defined,  with  the  property  that  a 
statement  is  c-entailed  by  an  observation  if  and  only  if  that  statement  deductively  fol¬ 
lows  from  the  observation  and  the  closure  of  the  event  hierarchy.  That  is, 

r  h  K:  Q  ^  and  only  if  Tu  cl(H)  (~  Q 

Thus  cl(H)  axiomatically  captures  the  class  of  covering  models;  or,  equivalently,  ex- 
plicidy  states  the  closure  assumptions  in  effect  when  one  uses  an  event  hierarchy  for 
recognition.  The  theory  of  c-entailment  is  related  to  John  McCarthy's  system  cf  non¬ 
monotonic  inference  known  as  circumscription. 

When  several  events  are  observed,  still  stronger  assumptions  are  commonly 
employed.  Suppose  that  (GetGun(C),  GoToBank(D)}  is  observed.  This  set  does  not 
c -entail  an  instance  of  robbery;  the  model  containing  an  instance  of  hunting  and  an  in¬ 
stance  of  check  cashing  provides  a  counterexample.  By  Occam's  razor  (do  not  multiply 
entities  unnecessarily)  we  would  be  justified  in  concluding  3x  .  RobBank(x);  this  prin¬ 
ciple  can  be  realized  by  distinguishing  the  minimum  covering  models  of  the  observa¬ 
tions.  These  models  define  a  final  semantic  relation  between  observations  and  conclu¬ 
sions,  mc-entailment.  In  the  example, 

(GetGun(C),  GoToBank(D)}  ^  hmc  3x  .  RobBank(x)  a  C=sl(x)  a  D=s2(x) 
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Chapter  4  develops  the  theory  of  minimum  covering  models  and  mc-entailment.  Mc- 
entailment  is  shown  to  correspond  to  a  particular  case  of  circumscription  with  variables. 

Chapter  5  extends  the  theory  to  describe  incremental  plan  recognition.  An  in¬ 
cremental  recognition  process  cyclically  makes  observations  and  infers  consequences. 
The  conclusions  reached  at  the  end  of  any  particular  cycle  form  the  basis  for  those 
reached  in  the  next  cycle.  Incremental  recognition  is  dichronic:  the  conclusions 
reached  may  depend  upon  the  order  of  the  observations.  While  this  may  make  it  non- 
optimal  for  some  recognition  problems,  we  shall  argue  that  it  is  a  plausible  description, 
on  both  computational  and  psychological  grounds,  of  the  kind  of  inference  performed 
in  plan  recognition  and  its  applications. 

This  formal  framework  shows  how  one  can  infer  "up"  an  event  hierarchy,  and 
unify  the  explanations  of  several  observations  in  order  to  reach  a  stronger  conclusion. 
Very  few  restrictions  are  placed  on  the  kinds  of  events  which  can  be  encoded:  disjunc¬ 
tions  may  appear  in  plans  or  in  observations,  observations  may  be  incomplete,  and  ar¬ 
bitrary  temporal  constraints  may  appear  between  events.  Several  examples  of  plan 
recognition  are  discussed  in  detail  in  Chapter  6,  from  the  domains  of  cooking,  natural 
language  pragmatics,  and  operating  system  interfaces.  The  chapter  concludes  with  a 
comparison  of  plan  recognition  to  medical  diagnosis.  Diseases  can  be  taken  to  be  End 
events,  and  symptoms  to  be  component  events.  A  problem  handled  by  a  version  of 
INTERNIST,  a  medical  expert  system,  is  recast  in  our  framework. 

Traditional  work  in  artificial  intelligence  on  high  level2  recognition  problems 
has  often  relied  on  graph-matching  algorithms.  A  lexical  hierarchy  is  very  naturally 
represented  as  a  labeled  digraph,  and  such  graphs  can  be  given  a  straightforward  se¬ 
mantic  interpretation.  Chapter  7  describes  some  graph-based  recognition  algorithms 
which  are  justified  by  the  notion  of  mc-entailment:  that  is,  they  compute  structures 
which  can  be  interpreted  as  statements  true  in  all  minimum  covering  models.  The  algo¬ 
rithms  are  useful  and  interesting  in  that  they  suggest  what  conclusions  should  be 
drawn,  while  the  formal  framework  only  specifies  what  conclusions  can  be  drawn. 
There  are  three  basic  algorithms.  The  first  constructs  an  AND/OR  graph  bottom  up, 
from  each  observation  to  an  instance  of  End.  The  second  minimizes  the  set  of  End 
events,  by  performing  a  top-down  match  of  the  structures  created  in  the  previous  step. 
The  third  algorithm  controls  the  first  two,  and  decides  which  observations  are  part  of 


2High  level  recognition  problems  involve  the  application  of  a  great  deal  of  specific  world  knowledge  to 
a  relatively  small  amount  of  symbolic  (non-numeric)  data.  Low  level  recognition  problems  reverse 
this  situation:  there  is  a  massive  amount  of  quantitative  data,  interpreted  by  general  principles  about 
the  physical  nature  of  the  domain. 
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the  same  of  End  event  Has  the  framework’s  power  and  generality  been  bought  at  the 
cost  of  computational  intractability?  We  believe  not  While  it  is  easy  to  show  that  the 
worst-case  cost  of  solving  a  plan  recognition  problem  matches  that  of  general  deduction 
(namely,  exponential  on  the  size  of  the  knowledge  base),  careful  use  of  event  abstrac¬ 
tion  significantly  collapses  the  search  space. 

1.3.  Related  Work  on  Plan  Recognition 

Work  in  A.L  on  plan  recognition  has  concentrated  on  story  understanding,  dis¬ 
course,  and  intelligent  computer  interfaces  and  environments.  Each  area  is  reviewed  in 
roughly  chronological  order. 

1.3.1.  Story  Understanding 

1.3. 1.1.  Psychological  Modeling 

One  of  the  first  papers  to  explicitly  invoke  the  phrase  "plan  recognition"  was  the 
report  by  [Schmidt,  Sridharan,  &  Goodson  78]  on  the  BELIEVER  system. 
BELIEVER  was  designed  to  illustrate  and  test  a  psychological  theory  of  "how  descrip¬ 
tions  of  observed  actions  are  utilized  to  attribute  intentions,  beliefs,  and  goals  to  the 
actor."  Schmidt  and  his  colleagues  conducted  experiments  in  which  human  subjects 
were  presented  simple  descriptions  of  sequences  of  actions  by  a  single  agent  The  se¬ 
quences  were  interrupted  at  various  points,  and  the  subjects  were  asked  to  summarize 
the  events  so  far,  describe  what  the  agent  was  trying  to  do,  or  predict  what  the  agent 
would  do  next  The  researchers  observed  that: 

1.  Summaries  often  included  non-de scribed,  but  expected,  actions. 

2.  Subjects  did  not  provide  summaries  which  referred  to  a  disjunctive 

set  of  plans,  but  they  did  provide  "sketchy”  summaries. 

3.  Subjects  often  provided  summaries  of  the  form:  "The  agent  was 

trying  to  do  (some  act),  but  failed  because  (some  reason)." 

From  these  and  similar  observations,  Schmidt  concluded  that  people  understood 
and  remembered  event  sequences  by  recovering  the  implicit  structure  of  causal  relations 
between  the  events.  Schmidt  argued  that  plan  recognition  is  a  single-minded,  hypothe¬ 
sis-driven  process.  Based  on  the  initial  observations  (descriptions),  the  subjects 
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seemed  to  devise  a  single  hypothesized  plan  (for  the  actor).  This  hypothetical  plan 
would  be  incrementally  revised  and  made  more  detailed  as  further  observations  were 
made. 


BELIEVER  implemented  this  strategy  of  plan  recognition.  Once  the  setting  of 
the  story  was  input,  BELIEVER  would  retrieve  a  single  parameterized  plan  from  mem¬ 
ory.  As  each  observation  was  input,  BELIEVER  tried  to  match  the  description  against 
an  expected  action  (that  is,  an  action  in  the  plan  whose  preconditions  were  true). 
Failure  of  the  observation  to  match  an  expected  action  would  trigger  "critics",  that  is, 
specialized  pieces  of  code  which  revised  the  plan  to  account  for  errors  and  accidents  by 
the  agent 

The  work  on  BELIEVER  is  quite  different  from  our  own.  While  BELIEVER 
was  a  "top  down”  inference  system,  we  have  concentrated  on  the  "bottom  up"  aspects 
of  plan  recognition:  if  many  different  ultimate  plans  are  possible,  how  do  we  home  in 
on  just  a  few  possibilities?  BELIEVER  was  offered  as  a  rough  psychological  model, 
while  we  have  concentrated  on  building  a  tight  mathematical  model  of  "ideal”  perfor¬ 
mance.  Schmidt's  work  began  to  deal  with  the  critical  issues  of  errors  and  accidents, 
which  we  have  not  examined.  (Recent  work  by  [Pollack  84]  investigates  problems  in 
recognizing  erroneous  plans.)  Both  Schmidt  and  our  work  stress  the  use  of  a  library 
of  plans,  and  the  importance  of  "sketchy"  or  abstract  plans;  our  work  extends 
BELIEVER'S  notion  of  abstraction,  which  involved  plans  with  uninstantiated  parame¬ 
ters,  but  not  a  hierarchy  of  abstract  plan  types. 

After  the  BELIEVER  project  interest  grew  in  devising  systems  that  could  un¬ 
derstand  stories  which  involved  a  number  of  different  characters,  each  pursing  goals 
and  plans  which  could  interact  and  conflict  with  the  plans  of  the  other  characters. 
[Bruce  81]  presented  a  detailed  analysis  of  the  interacting  plans  in  a  number  of  chil¬ 
dren's  stories,  including  Hansel  and  Gretel.  Unfortunately  Bruce  did  not  continue  this 
line  of  research  with  a  computational  model.  The  greatest  nexus  of  interest  in  story  un¬ 
derstanding  grew  up  in  Roger  Schanks'  group  at  Yale. 

1.3. 1.2.  Script  Based  Systems 

Schank's  theory  of  scripts  was  designed  to  account  for  all  kinds  of  regularities 
in  the  world,  both  physical  and  social  [Schank  75].  For  example,  the  restaurant  script 
tells  us  that  restaurants  typically  have  tables  and  chairs,  and  that  a  person  in  a  restaurant 
typically  has  the  goal  of  eating. 
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While  scripts  alone  only  provide  a  limited  version  of  planning  and  plan 
recognition,  [Wilensky  82]  extended  script  theory  to  handle  more  dynamic  domains. 
Wilensky  argued  that  most  everyday  planning  problems  involve  little  or  no  search 
("canned"  plans  are  known  for  all  possible  goals)  but  do  involve  complicated  interac¬ 
tions  between  goals.  Furthermore,  plan  recognition  is  inextricably  intertwined  with 
plan  synthesis.  For  example,  given  the  following  text* 

John  wanted  the  newspaper.  It  was  raining  outside,  so 

John  called  for  his  dog  Spot. 

Wilensky  claimed  one  would  conclude  that  John  wants  Spot  to  fetch  the  newspaper  by 
reasoning  as  follows:  Having  the  newspaper  is  one  of  John's  goals.  The  standard  plan 
for  this  is  to  go  outside  and  get  the  newspaper,  and  so  this  standard  plan  is  (tentatively) 
included  in  (the  reader's  beliefs  about)  John's  wants.  The  fact  that  it  is  raining  would 
invoke  the  stay  dry  goal,  so  this  is  added  to  John's  wants.  The  reader  then  simulates 
John's  planning:  the  goals  conflict,  so  a  resolve  conflict  goal  is  also  created.  There 
are  many  different  ways  that  resolve  conflict  can  be  achieved,  so  the  recognizer  stops 
planning.  Now  the  reader  learns  that  John  called  for  Spot  The  effect  of  this  is  deter¬ 
mined  to  be  that  Spot  is  with  John.  The  reader  tries  to  connect  this  new  piece  of  input, 
as  tightly  as  possible,  with  the  cunent  plan.  It  notices  that  one  expansion  of  the  replan 
meta-plan  for  the  resolve  conflict  meta-goal  is  to  alter  the  get  paper  plan  so  that  some 
other  agent  goes  outside  and  gets  the  paper.  Making  Spot  the  other  agent  connects  this 
plan  to  the  input 

While  parts  of  this  theory  were  implemented  in  many  computer  programs,  much 
remained  vague.  A  crucial  feature  were  the  so-called  text  comprehension  principles, 
which  were  used  to  "tie  together"  the  individual  sentences  of  the  story.  These  were  co¬ 
herence  ,  meaning  the  character's  plan  is  consistent;  least  commitment ,  meaning  that  a 
reader  shouldn't  prematurely  assume  that  any  particular  explanation  found  is  the 
explanation,  and  then  have  to  undo  it;  and  parsimony  ,  meaning  that  a  reader  should 
"maximize  the  connections  between  the  inputs".  Specific  versions  of  these  general 
principles  appear  in  our  framework  for  plan  recognition.  Our  model  theoretic  approach 
forces  us  to  only  consider  consistent  plans  (inconsistent  plans  would  have  no  model). 
Least  commitment  arises  from  concluding  what  holds  in  all  minimum  covering  models, 
rather  than  a  particular  one.  The  principle  of  parsimony  corresponds  to  our  minimiza¬ 
tion  of  unrelated  events. 

1.3. 1.3.  Abduction 
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Recent  work  by  [Chamiak  85]  views  plan  recognition,  or  "motivation  analy¬ 
sis",  as  a  kind  of  abductive  inference.  The  term  "abduction"  comes  from  an  early  at¬ 
tempt  in  the  philosophy  of  science  to  look  at  explanation  as  the  process  of  finding  the 
best  hypothesis  which  logically  entails  the  thing  to  be  explained  [Peirce  58]. 3  For  ex¬ 
ample,  suppose  we  known  that  all  men  are  mortal: 

Vx  .  man(x)  d  mortal(x) 

The  thing  to  be  explained  is  that  Socrates  is  mortal.  An  abductive  inference  would  take 
us  from  mortal(Socrates)  to  man(Socrates).  Thus  Socrates  is  mortal  because  he's  a 
man.  Abduction  is  obviously  a  very  unsound  rule  of  inference,  that  can  lead  to  ridicu¬ 
lous  conclusions.  Pierce  never  claimed  abduction  was  all  that  was  involved  in  scientific 
explanation:  it  was  merely  a  method  of  generating  hypotheses  that  could  be  tested  and 
weighed  by  other  means. 

Chamiak's  system  employs  abductive  techniques  to  generate  hypotheses  which 
entail  the  (logical  forms  of)  the  sentences  in  simple  stories.  In  order  to  limit  the  amount 
of  inference,  only  a  small  subset  of  the  reader's  knowledge  is  assumed  to  be  available 
at  any  one  time.  For  example,  a  story  might  be: 

John  went  to  the  store.  He  walked  down  the  aisle  and 
picked  up  the  milk. 

The  individual  words  in  the  story,  such  as  "store"  and  "milk",  activate  the  semantically- 
related  plan  of  going  grocery  shopping  in  the  system’s  memory,  as  well  as  related  ax¬ 
ioms  about  how  shopping  carts  and  check-out  lines  work.  The  system  then  tries  to 
generate  a  resolution-style  proof  that  the  story  occurs,  where  it  is  allowed  to  make  two 
kinds  of  assumptions: 

1.  An  instance  of  any  active  concept  may  be  assumed  to  exist  Thus  the 
theorem  prover  may  simply  assume  3x  .  Shopping-Trip(x),  if 
such  an  assumption  is  needed  to  close  off  a  branch  of  the  proof 
tree. 


2.  Any  two  terms  may  be  assumed  to  be  equal,  if  the  system  cannot 
prove  (within  some  finite  amount  of  time)  that  the  terms  are  not 
equal.  Thus  the  system  may  assume,  for  instance,  that  the  go- 
to-store  step  of  the  Shopping-Trip  event  is  equal  to  the  particular 
instance  of  going  to  the  store  that  is  mentioned  in  the  story. 


Eventually  Peirce  abandoned  the  notion  of  abduction  [Hacking  83]. 
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Chamiak's  framework  is  much  less  "cautious"  than  our  own.  If  several  differ¬ 
ent  plans  could  entail  the  observations,  it  must  choose  a  single  one,  whereas  our 
framework  would  simply  conclude  the  disjunction.  It  is  also  not  entirely  clear  how 
well  the  technique  of  controlling  inference  by  drawing  activation  from  lexical  items  re¬ 
ally  works:  it  would  be  possible  to  "tune"  such  an  activation  network  so  that  It  activat¬ 
ed  just  the  formulas  needed  for  any  particular  example.  No  semantic  basis  for  the  sys¬ 
tem  is  given,  or  indeed  seems  possible.  Despite  these  criticisms,  Chamiak’s  work  is 
among  the  most  principled  in  the  "scruffy"  area  of  A.I.4 

1.3.2.  Discourse 

1. 3.2.1.  Allen  and  Perrault 

[Cohen  78]  first  formalized  Austin's  and  Searle’s  speech  act  theory  in  terms  of 
planning.  Utterances  were  viewed  as  actions  which  transformed  the  beliefs  of  the 
speaker  and  hearer.  [Allen  &  Perrault  80]  extended  the  analysis  to  include  plan  recog¬ 
nition.  Plan  recognition  is  necessary  to  account  for  the  fact  that  a  speaker  need  not  fully 
and  literally  execute  a  speech  act  in  order  to  achieve  its  effect  Instead,  the  speaker  need 
only  perform  an  act  which  suggests  to  the  hearer  that  the  speaker's  overall  intention  is 
to  achieve  the  desired  effect  Phenomena  accounted  for  by  plan  recognition  include  in¬ 
direct  speech  acts,  understanding  of  sentence  fragments,  and  certain  kinds  of  context- 
dependent  implications. 

Allen  analyzed  plan  recognition  in  terms  of  a  set  of  plan  recognition  rules  to¬ 
gether  with  a  heuristic  control  strategy.  The  rules  isolate  those  inferences  which  are 
plausible ,  but  not  valid.  The  control  strategy  determines  which  of  these  inferences 
should  actually  be  accepted.  An  example  of  these  rules  is  precondition/action  rule, 
which  states  that  if  (agent)  H  believes  (agent)  S  wants  proposition  P  to  hold,  then  H 
may  plausibly  conclude  that  S  wants  action  Act  to  be  performed,  given  that  P  is  a  pre¬ 
condition5  for  Act  Another  is  th zeffect! action  rule,  *hat  states  if  S  wants  P  to  hold, 
and  P  is  achieved  by  Act,  then  S  may  want  Act  to  occur. 


4  Workers  in  Artificial  Intelligence  are  often  divided  into  the  neat  and  scruffy  camps  [Schank  83],  with 
the  neats  trying  to  create  formal  theories  [Kautz  86c]  which  systematize  the  heuristics  uncovered  by  the 
intuition -driven  scruffies. 
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precondition! action 


H  Bel  S  Wants  P  ^  H  Bel  S  Wants  Act 

given  that  P  is  a  precondition  of  Act 


effectlacnon 

S  Wants  P  =>  S  Wants  Act 

given  that  P  is  an  effect  of  Act 

Rules  such  as  these  were  applied  to  form  a  chain  of  inference  from  a  single  observed 
action  (called  the  alternative )  to  one  of  a  number  of  possible  contextually-dependent 
goals  (called  the  expectations  ).  Allen’s  system  tried  to  find  the  "most  likely"  (or 
perhaps  "most  obvious")  chain  by  performing  a  best-first  search,  numerically  scoring 
chains  of  inference6  by  rules  like  the  following: 

(HI)  Decrease  the  rating  of  a  chain  of  inference  if  it  contains  an  action 
whose  preconditions  are  false  at  the  time  the  action  starts  exe¬ 
cuting. 

(H3)  Increase  the  rating  of  a  chain  of  inference  if  it  contains  descrip¬ 
tions  of  objects  and  relations  in  its  alternative  that  are  unifiable 
with  objects  and  relations  in  its  expectation. 

One  of  the  most  interesting  rating  heuristics  is  applicable  only  in  communicative  do¬ 
mains,  where  the  observer  can  assume  that  the  actor  is  trying  to  make  the  intentions  be- 
hinds  his  acts  obvious.  It  says  to  lower  the  rating  of  a  chain  of  inference  if  a  great 
many  different  rules  all  apply  to  its  last  step.  Thus  ambiguity  is  penalized. 

Allen  did  not  try  to  provide  any  theoretical  underpinning  for  his  rules  of  plausi¬ 
ble  inference,  in  terms  of  probability  or  model  theory.  Nor  did  his  system  deal  with 
multiple  observations,  which  is  the  main  focus  of  attention  in  our  work  (as  well  as  that 
of  [Litman  84],  discussed  in  the  next  section).  The  basic  notion  of  forming  a  chain 
connecting  an  observation  to  a  goal  seems  related  to  our  idea  of  a  covering  model,  in 
which  every  event  is  part  of  some  End  (goal)  event 

1.3.2. 2.  Extended  Discourse 

While  a  number  of  researchers  [Sidner  &  Israel  81,  Carbeny  83,  Grosz  & 
Sidner  87]  have  begun  to  extend  Allen's  work  to  deal  with  sequences  of  utterances, 

6 Allen  called  such  chains  of  inference  "partial  plans". 
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[Litman  84]  contains  the  most  detailed  and  concrete  proposals. 

Litman's  plan  recognition  system  includes  a  set  of  domain-specific  plan 
schemas,  a  set  of  domain-independent  meta-plan  schemas,  and  an  incremental  recogni¬ 
tion  algorithm.  Meta-plans  are  plans  which  can  take  other  plans  as  arguments.  They 
include,  for  example,  a  plan  to  help  a  hearer  identify  a  parameter  which  appears  in  an¬ 
other  plan  (IDENTIFY -PARAMETER) ,  and  a  plan  to  insert  a  repair  step  into  a  plan 
which  would  otherwise  fail  (CORRECT-PLAN).  Litman's  work  is  notable  in  provid¬ 
ing  a  deterministic,  highly  constrained  recognition  algorithm,  and  in  cleanly  accounting 
for  plan  suspension  and  resumption. 

The  plan  recognition  algorithm  constructs  a  stack  of  partially-recognized  plans. 
When  it  observes  an  action,  the  system  attempts  to  attach  it  somewhere  on  the  stack, 
according  to  the  following  preferences: 

1 .  Attach  (as  a  substep)  to  the  plan  on  the  top  of  the  stack; 

2.  Attach  to  a  new  meta-plan,  which  refers  to  a  plan  somewhere  in  the 

stack,  and  push  that  meta-plan  onto  the  stack; 

3.  Attach  to  a  new  meta-plan,  which  refers  to  some  other  new  plan.  If 

that  otter  plan  is  also  a  meta-plan,  construct  a  plan  for  it  to  refer 
to,  and  so  on,  until  a  domain-specific  plan  is  reached.  Push  ev¬ 
erything  onto  the  stack,  with  the  domain-specific  plan  on  the 
bottom  and  the  original  meta-plan  on  top. 

Attaching  an  observed  act  to  a  partial  plan  may  require  that  certain  equalities  be  assumed 
to  hold  between  parameters  of  the  act  and  the  plan.  Litman  calls  this  consistency  unifi¬ 
cation.  A  similar  result  is  obtained  in  our  framework  by  different  means.  As  will  be 
described  in  Chapter  4,  the  assumption  that  End  events  are  equal  can  lead  to  the 
conclusion  that  certain  parameters  of  substeps  of  the  event  are  equal 

Both  Litman's  algorithm  and  our  own  (as  will  be  seen  in  Chapter  7)  rely  on 
constraint  propagation  to  eliminate  alternative  interpretations.  Her  system  exemplifies  a 
backtrack-free,  incremental  recognition  process,  as  described  in  Chapter  5.  Our 
"sticky"  incremental  recognition  theory  is  a  very  crude  approximation  to  Litman’s. 

A  significant  difference  between  her  work  and  ours  is  our  treatment  of  disjunc¬ 
tion.  Our  theory  justifies  inference  through  disjunctions,  allowing  disjunctions  to  mul¬ 
tiply7  in  the  final  conclusion.  Litman  argues  that  discourse  should  provide  enough  lin- 

7 Although  in  practice  disjunctions  often  collapse  again  by  abstraction,  as  will  be  seen. 
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guistic  clues  such  as  intonation,  gesture,  and  keywords  to  eliminate  most  disjunctions. 
Following  [Sidner  85],  Litman  claims  that  if  the  interpretation  remains  ambiguous,  one 
should  halt  inference  and  simply  wait  fir  further  utterances. 


13.2.3.  Cohen  and  Levesque 

Recent  work  reported  in  [Cohen  &  Levesque  80,  Cohen  &  Levesque  87]  tries 
to  derive  speech  act  and  discourse  theory  from  general  principles  of  rational  interaction. 
The  work  axiomatizes  (part  of)  the  space  of  possible  inferences  available  to  an  agent 
engaged  in  planning  and  plan  recognition.  They  have  not  dealt  with  selecting  between 
alternative  interpretations  of  an  observation. 

Cohen  &  Levesque  developed  a  version  of  dynamic  logic  [Harel  79]  enriched 
with  a  logic  of  belief  [Hintikka  62]  to  represent  actions,  beliefs,  and  intentions. 
Axioms  in  this  language  formally  encode  rules  similar  to  those  of  Allen.  For  example, 
(one  version  of)  the  shared-recognition  precondition/action  axiom  is 

(imply 

(BMByx 

(and  (or  (CAUSE  xp  (CANxq)) 

(CAUSExp  (CAN  y  q))) 

(EXPECT  y  (GOAL  xq)) 

— <(GOAL  y  — <q) 

(HELPFUL  y  x))) 

(CAUSE  x 

(6MB  y  x  (GOAL  x  (GOAL  y  p))) 

(BMB  y  x  (GOAL  x 

(GOAL  y  (FINITELY -W AIT-FOR  x  q)))) 

) 

) 

The  formula  (BMB  y  x  ... )  means  that  y  believes  that  it  is  mutually  believed  between 
x  and  y  that . . .  The  axiom  states  that  if  y  and  x  mutually  believe  that  p  enables  q,  and  y 
expects  that  x  will  eventually  want  q,  and  y  doesn't  want  not-q,  and  y  is  helpful  to  x, 
then  whatever  x  does  to  make  it  mutually  believed  that  x  wants  y  to  want  p,  will  also 
make  it  mutually  believed  that  x  wants  y  to  achieve  q  in  the  future  (that  is,  x  won't  have 
to  wait  forever  for  q  to  occur). 

Cohen  &  Levesque  have  concentrated  on  getting  all  the  details  of  the  represen¬ 
tation  of  mental  attitudes  just  right,  and  have  deliberately  ignored  the  issue  of  control¬ 
ling  inference.  Our  own  work  has  gone  to  the  other  extreme,  and  completely  ignored 
the  representation  of  belief.  Indeed,  we  have  not  distinguished  between  what  the  agent 
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wants  to  come  about  and  what  actually  will  come  about;  we've  assumed  all  plans  are 
successful.  On  the  other  hand,  our  system  tells  us  what  conclusions  are  justified, 
rather  than  merely  what  conc  isions  are  possible .  (It  is  not  entirely  clear,  in  any  case, 
what  a  "possible"  conclusion  is!)  A  sophisticated  plan  recognition  system  will  eventu¬ 
ally  have  to  deal  with  the  kinds  of  representational  issues  raised  by  Cohen  &  Levesque. 

1.3.3.  Intelligent  Computer  Environments 

A  natural  application  area  for  the  discourse  systems  discussed  above  is  the  hu¬ 
man/computer  interface  itself.  Plan  recognition  is  a  central  component  of  several  pro¬ 
grams  of  research  aimed  at  creating  automated  consultants,  systems  which  would  help  a 
person  use  a  particular,  complicated  program,  or  perhaps  an  entire  operating  system. 

1.3. 3.1.  The  MACSYMA  Advisor 

One  of  the  earliest  automated  consultants  [Genesereth  79]  helped  people  use 
MACSYMA,  a  powerful  program  for  manipulating  symbolic  equations.  Genesereth 
created  a  model,  MUSER,  of  how  a  user  typically  breaks  down  a  task  when  using 
MACSYMA.  This  model  related  the  task,  or  plan,  structure  to  the  structure  of  the  for¬ 
mulas  being  manipulated.  Plans  were  represented  as  procedural  nets  [Sacerdoti  77], 
together  with  input/output  links  between  various  steps.  The  library  contained  both 
common  plans  and  common  mistakes. 

When  a  user  had  a  problem  with  MACSYMA,  he  would  invi  e  the  advisor, 
and  tell  it  both  his  intended  goal  and  what  he  had  actually  done.  The  advisor  then  built 
a  possibly  "buggy"  plan  graph  which  connected  the  two.  The  advisor  then  debugged 
the  plan  and  told  the  user  what  to  do. 

The  advisor  used  an  ordered  set  of  plan  recognition  rules,  which  are  very  simi¬ 
lar  to  those  used  by  Allen  and  Litman.  The  rules  applied  deterministically;  the  partial 
plan  was  expanded  only  in  unambiguous  cases.  An  "escape  hatch"  existed  in  that  the 
advisor  would  ask  the  user  for  clarification  in  case  of  ambiguity.  Genesereth's  work 
raised  many  issues  and  techniques  which  were  developed  (or  rediscovered)  by  later  re¬ 
searchers.  Like  Litman 's  system,  the  consultant  provided  a  "limited"  inference  mecha¬ 
nism,  and  could  not  just  chain  off  in  arbitrary  directions.  He  did  not,  however,  for¬ 
malize  the  principles  above  (except  in  a  particular  implementation  in  LISP),  or  deal 
with  multiple  expectations  or  concurrent  multiple  plans  -  issues  central  to  our  own 
work. 
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1.3. 3. 2.  A  Smart  Operating  System:  Plan  Parsing 


An  operating  system  consultant  under  development  at  the  University  of 
Massachusetts  [Huff  &  Lesser  82]  is  notable  for  dealing  with  multiple  concurrent 
plans,  and  for  relating  plan  recognition  to  parsing. 

The  system  tracks  user's  actions,  and  allows  users  to  specify  high-level  com¬ 
mands  which  are  disambiguated  by  context  A  library  of  operating-system  level  tasks, 
such  as  compile  or  edit,  is  encoded  by  a  set  of  grammatical  rewrite  rules,  together 
with  a  list  of  constraints.  The  rules  employ  an  extended  version  of  regular  expres¬ 
sions.8  For  example,  the  plan  to  update  source  code  is  partly  described  by  the  rewrite 
rule: 


updatesourceunit  => 

((edit  compile  check_results)  | 

(edit  compile)  | 

(compile  check_results)  | 

(compile))+ 

Since  a  programmer  may  be  working  on  several  different  projects  during  the  same  ses¬ 
sion,  the  notion  of  a  regular  grammar  is  extended  to  that  of  a  shuffle  grammar.  If  e  and 
f  are  expressions,  their  shuffle,  written  e$f,  is  the  set  of  strings  con  structed  by  mixing 
together  a  string  of  e  with  a  string  of  f.  The  interleave  of  an  expression  e,  written  e@, 
is  the  expression  shuffled  with  itself,  an  arbitrary  number  of  times.  For  example,  the 
fact  that  several  unrelated  programs  may  be  worked  on  simultaneously  is  represented 
by  the  grammar  rule 

programmingwork  => 

(do_programming  | 
do_documentation  | 
make_errors)@ 

The  intelligent  interface  tries  to  pane  the  (partial)  input  of  the  user  as  it  is  received.  It 
employs  heuristics  for  ordering  alternative  partial  interpretations  of  an  observation 
when  parsing.  The  heuristics  try  to  "minimize"  the  amount  of  mixing  performed  by  the 
shuffle  operator,  and  the  number  of  shuffles  invoked  by  the  interleave  operator.  For 
example,  it  prefers  the  shuffle  eeeeffff  over  eeffeeff,  which  is  preferred  over 


8 A  regular  expression  is  a  string  made  up  terminal  and  non-terminal  symbols  connected  by  the 
operators  |  (or),  *  (repeat  0  or  more  times),  +  (repeat  l  or  more  times),  and  blank  (concatenation), 
possibly  grouped  by  parentheses.  A  rewrite  rule  specifies  that  a  non-terminal  symbol  on  its  left-hand 
side  may  be  replaced  by  the  expression  on  its  right. 
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efefefef.  Likewise  for  interleave,  s  is  preferred  over  s$s,  which  is  preferred  over 
s$s$s.  The  heuristics  include: 

1.  Prefer  (linking  an  observation  to)  an  existing  plan  instantiation  over 

creating  a  new  instantiation. 

2.  Prefer  a  new  related  instantiation  to  a  new  unrelated  one. 

3.  If  alternative  interpretations  both  appear  in  the  same  higher-level 

containing  plan,  prefer  the  interpretation  which  appears  first  in 
that  higher-level  plan. 

Huff  &  Lesser's  parsing  heuristics  yield  conclusions  similar  to  those  obtained 
by  the  incremental  recognition  theories  discussed  in  Chapter  5.  The  plan  recognition 
algorithms  described  in  Chapter  7  have  a  strong  flavor  of  parsing,  despite  their  origins 
in  logical  inference.  Just  as  parsing  can  be  viewed  as  logical  inference9,  specialized 
inference  can  be  cast  as  parsing.  But  there  are  at  least  two  crucial  differences  between 
our  work  and  Huff  &  Lesser's.  First,  we  treat  all  temporal  orderings  between  the 
components  of  an  event  as  constraints.  Any  temporal  relation  may  be  specified:  for 
example,  one  step  may  be  during  another,  times  may  overlap,  or  one  step  may  be  re¬ 
quired  to  occur  either  before  or  after  another  (without  specifying  which  alternative). 
Second,  our  algorithms  correspond  to  a  particular  model  theory,  while  Huff  &  Lesser's 
are  admittedly  ad  hoc.  The  inclusion  of  the  shuffle  operator,  and  the  need  to  reach 
conclusions  before  the  end  of  a  sequence  of  observations,  prevents  Huff  &  Lesser  from 
using  any  standard,  well- understood  parsing  algorithms. 

1 .4.  Related  Work  on  Medical  Diagnosis 

One  of  the  most  active  areas  of  applied  research  in  A.I.  has  been  in  the  area  of 
medical  diagnosis.  Diagnosis  and  plan  recognition  are  similar  kinds  of  high-level 
recognition  problems.  (In  fact,  an  early  paper  by  [Pople  73],  who  was  later  to  create 
the  INTERNIST  diagnostic  system,  explicitly  made  the  connection.)  In  both  cases  one 
needs  to  find  the  best  explanation  for  some  phenomenon,  using  a  hierarchically  struc¬ 
tured  body  of  domain-specific  knowledge.  The  vocabulary  of  events  can  be  mapped  to 
one  appropriate  for  diagnosis  in  a  straightforward  way.  (See  Chapter  2  for  the  details 
of  the  Event  vocabulary.)  Events  are  replaced  by  pathological  states  of  a  patient.  An 
abstraction  hierarchy  over  pathological  states  is  known  as  a  nosology.  The 


9See  any  of  the  work  on  parsing  with  Prolog,  such  as  [Pereira  &  Warren  80). 
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dealt  with  this  issue,  yet  it  is  critical  if  we  are  to  build  systems  which  can  actively  en¬ 
gage  in  a  discourse  or  other  task  involving  plan  recognition. 

1.4.1.  INTERNIST  and  CADUCEUS 

One  of  the  best-known  diagnostic  systems  is  INTERNIST,  an  ambitious  pro¬ 
gram  which  aimed  to  cover  almost  the  entire  field  of  internal  medicine.  The  first  ver¬ 
sion  of  INTERIST  employed  a  causal  hierarchy,  and  tried  to  chain  from  a  group  of  ob¬ 
served  symptoms  to  one  or  more  specific  disease  entities.  The  succeeding  versions  of 
the  program,  INTERNIST-II  and  ultimately  CADUCEUS,  employed  multiple  abstrac¬ 
tion  (nosology)  hierarchies  as  well.  A  detailed  description  of  the  project  appears  in 
[Pople  82]. 

A  task  for  CADUCEUS  was  a  data  structure  that  explained  a  set  of  observations 
by  causally  linking  them  to  one  or  more  pathological  states.  The  pathological  states  did 
not  have  to  be  ultimate,  specific  disease  entities;  they  could  be  any  states  within  the  tax¬ 
onomy.  The  task  could  contain  certain  kinds  of  disjunctions.  For  example,  a  task 
could  link  symptom  A  to  causes  B,  C,  and  D,  meaning  that  either  B  caused  A,  or  C 
caused  A,  or  D  caused  A.  Such  a  disjunction  was  called  a  differential  diagnosis  set. 
The  differential  diagnosis  sets  could  collapse  within  the  task.  For  example,  all  of  B,  C, 
and  D  could  be  linked  by  a  subclassification  arc  to  a  more  general  disease  type  E. 

CADUCEUS  created  a  task  for  each  observed  symptom,  and  then  performed  a 
heuristic  search  to  attempt  to  combine  the  tasks  into  a  smaller  set  During  the  search, 
CADUCEUS  could  interactively  prompt  for  information  that  could  would  be  used  to 
reduce  differential  diagnosis  sets.  An  example  of  a  operator  that  CADUCEUS  could 
apply  to  two  tasks  in  order  to  combine  them  is  Combined  Causal! Subclassification :10 

Where  P  and  Q  are  descriptors  in  tasks  Tj  and  T2  respectively: 

IF  P  is  (directly)  caused  by  Si  or  S2  or ...  Sn  AND 

Si,  S2, ...  Sn  all  (directly)  sub-classify  Q  THEN 

Link  Ti  and  T2  by  adding  caused-by  arcs  from  P 

to  each  of  Si,  S2, ...  Sn,  and  sub-classification  arcs 
from  each  of  Si,  S2, ...  Sn  to  Q 

In  Chapter  7,  our  algorithm  for  plan  recognition  employs  a  data  structure  called 
an  ”e-graph",  which  links  a  set  of  observed  events  to  some  End  event  E-graphs  are 


10The  notation  here  is  our  own,  not  Pople's. 


7A-D-  28 


decomposition  hierarchy  corresponds  to  a  causation  hierarchy.  If  pathological  state  A 
always  causes  pathological  state  B,  then  B  acts  as  a  component  of  A.  If  only  certain 
cases  of  A  cause  B,  then  one  can  introduce  a  specialization  of  A  that  has  component  B. 
The  most  basic  specializations  of  End  (unexplainable)  events  correspond  to  specific 
disease  entities ,  while  states  which  can  be  directly  observed  are  symptoms.  (Note  that 
a  symptom  may  also  cause  other  other  states:  e.g.,  high  blood  pressure  can  be  directly 
measured,  and  it  can  cause  a  heart  attack.) 

The  pattern  of  inference  in  plan  recognition  and  diagnosis  is  similar  as  well. 
Each  symptom  invokes  a  number  of  different  diseases  to  consider,  just  as  each  ob¬ 
served  event  in  our  framework  c-en tails  the  disjunction  of  its  uses.  Once  several  find¬ 
ings  are  obtained,  the  diagnostician  attempts  to  find  a  small  set  of  diseases  which  ac¬ 
counts  for,  or  covers,  all  the  findings.  This  step  corresponds  to  the  minimization  of 
End  events  in  mc-entailment  A  general  medical  diagnosis  system  must  deal  with  pa¬ 
tients  suffering  from  multiple  diseases;  our  plan  recognition  framework  was  designed 
to  account  for  multiple  concurrendy  executing  plans.  Finally,  our  work  departs  from 
previous  work  in  plan  recognition  by  explicidy  dealing  with  disjunctive  conclusions, 
which  are  winnowed  down  by  obtaining  more  observations.  These  disjunctive  sets 
correspond  to  the  differential  diagnosis  sets  which  play  a  central  role  in  medical  rea¬ 
soning. 


There  are  some  significant  differences  between  our  framework  and  those  used 
in  medical  expert  systems.  In  plan  recognition  it  is  necessary  to  distinguish  the  differ¬ 
ent  roles  that  one  event  could  play  as  a  component  of  another,  because  a  plan  may  have 
several  different  steps  of  the  same  type.  One  would  need  to  distinguish  the  instance  of 
the  "meet  with  committee"  event  which  initiates  the  plan  to  "obtain  PhD  degree"  from 
the  one  which  terminates  it  Medical  systems  simply  have  undifferentiated  "causes" 
links  from  diseases  to  symptoms.  They  make  the  assumption  that  a  disease  can't  cause 
two  different  instances  of  the  same  symptom.  In  addition,  the  role  of  time  is  critical  in 
plan  recognition,  but  usually  ignored  in  medical  systems.  But  future  medical  systems 
may  need  to  eliminate  both  differences.  It  certainly  could  be  the  case  that  a  pathological 
state  causes  two  different  instances  of  the  same  abstract  symptom.  Thus  AIDS  often 
causes  multiple  different  kinds  of  cancer  in  the  same  patient  [Patil,  Szolovits,  & 
Schwartz  82]  suggests  that  diagnostic  systems  should  incoiporate  a  model  of  the  time 
course  of  a  disease,  and  the  difference  symptoms  which  are  manifest  at  each  stage. 

Much  of  the  research  effort  in  medical  expert  systems  has  been  on  discovery 
procedures.  The  diagnostic  system  must  take  an  active  role,  asking  questions  in  order 
to  narrow  down  the  set  of  possible  diseases.  None  of  the  work  in  plan  recognition  has 
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similar  in  structure  to  CADUCEUS's  tasks.  But  because  our  e-graphs  always  contain 
an  End  node,  two  e-graphs  can  be  combined  by  simply  performing  a  top-down  match 
starting  at  the  End  nodes.  The  resulting  e-graph  encodes  all  possibly  ways  of  com¬ 
bining  the  original  graphs.  CADUCEUS  performs  only  a  heuristic  search,  and  there¬ 
fore  could  not  guarantee  completeness.  There  is  no  straightforward  way  to  tell  when  all 
possible  combinations  have  been  found,  or  whether  the  heuristics  need  to  run  longer. 
The  particular  set  of  combining  operators  are  (Pople  admits)  rather  arbitrary  and 
incomplete. 

An  interesting  feature  incorporated  in  CADUCEUS  that  our  framework  lacks 
are  planning  links,  which  connect  one  state  to  another  if  some  specialization  of  the  latter 
can  cause  the  former.  [Pople  82]  shows  how  such  links  can  be  very  useful  for  heuris- 
tically  guiding  search. 

1.4.2.  A  Set  Covering  Model 

Recent  work  by  [Reggia,  Nau,  &  Wang  83]  has  proposed  that  medical  diagno¬ 
sis  can  be  viewed  as  a  set  covering  problem.  Each  disease  corresponds  to  the  set  of  its 
symptoms,  and  the  diagnostic  task  is  to  find  a  minimum  cover  of  a  set  of  observed 
symptoms. 

While  this  idea  has  long  been  implicit  in  diagnostic  systems  such  as 
INTERNIST,  Reggia  &  Nau's  work  is  unique  in  providing  a  provably  correct  and 
complete  algorithm  which  not  only  finds  all  minimum  covers,  but  also  indicates  when 
and  how  the  diagnostician  should  request  additional  findings.  The  system  is  particu¬ 
larly  strong  in  dealing  with  multiple  simultaneous  diseases. 

Reggia  &  Nau  have  not  yet  expanded  the  work  to  cover  many-level  causal  or 
abstraction  hierarchies,  although  they  mention  that  as  a  topic  for  future  research.  The 
lack  of  abstraction  lets  them  stay  within  a  purely  propositional  framework,  which 
greatly  simplifies  the  algorithms.  Once  causal  and  abstraction  hierarchies  are  added, 
their  framework  and  algorithms  may  prove  to  be  very  similar  to  our  own. 

1.5.  Related  Work  on  Non-Deductive  Inference 

Most  recognition  problems  cannot  be  formalized  entirely  within  a  deductive 
system  -  unless  one  uses  it  as  a  meta-language  to  talk  about  a  different,  non-deductive 
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system.11  This  section  reviews  the  major  formal  systems  of  non-deductive  inference 
used  within  A.L,  concluding  with  circumscription,  the  system  most  closely  related  to 
our  own  work. 


1.5.1.  Probability  Theory 

Probability  theory  provides  the  basis  for  non-deductive  inference  in  practically 
all  the  sciences  outside  of  A.L.  Its  success  should  at  least  cause  the  computer  scientist 
to  hesitate  a  bit  before  devising  a  new  sort  of  theory.  Indeed,  a  growing  number  of  re¬ 
searchers  are  basing  systems  for  such  tasks  as  vision  or  medical  diagnosis  on  classical, 
Bayesian,  or  Dempster-Shafer12  theories  of  probability.  In  this  section  we  will  de¬ 
scribe  (part  of)  plan  recognition  in  terms  of  very  elementary  statistics.  Our  own  work 
in  plan  recognition,  and  much  of  that  cited  above,  can  be  viewed  a  method  of  perform¬ 
ing  certain  steps  in  the  probabilistic  inference.  A  logical  theory  of  plan  recognition  also 
addresses  issues  about  which  classical  probability  theory  has  little  to  say:  most  impor¬ 
tantly,  how  hypotheses  should  be  constructed,  and  when  a  likely  statement  should  be 
accepted  as  fact 

Let  Aj,  A2,  etc.  be  various  directly-observable  and  executable  actions.  A 
countably  infinite  number  of  plans,  labeled  Pi,  P2,  etc.  can  be  constructed  from  these 
actions.  Each  basic  hypothesis  Bi  is  of  the  form,  "the  agent  intends  to  perform  plan 
Pi."  A  composite  hypothesis  H  is  a  boolean  combination  of  basic  hypotheses.  For 
now,  we  will  only  consider  composite  hypotheses  which  are  conjunctions  of  basic 
hypotheses. 

What  is  the  general  form  of  a  plan  recognition  problem?  One  might  try  to 
determine  the  most  likely  composite  hypothesis.  There  is  only  a  trivial  answer  to  this 
problem:  the  empty  hypothesis,  that  makes  no  assertion  about  the  world.  We  cannot 
state  a  plan  recognition  problem  as  a  crisp  decision  problem.  The  best  one  can  do  is  to 
try  to  determine  the  sets  of  plans  (composite  hypotheses)  which  are  "most  likely"  on  the 
basis  of  the  observed  actions.  (Later  on  we’ll  mention  some  of  the  problems  in 
determining  the  exact  point  at  which  a  hypothesis  is  likely  enough  to  be  accepted.) 

Where  P  is  the  probability  function,  and  A  is  the  set  of  observed  actions,  the 
goal  is  find  those  H  for  which  P(H  |  A)  is  large.  By  Bayes'  theorem. 


11  For  example,  [Kyburg  74]  uses  first-order  logic  as  a  meta-language  to  axiomatize  a  system  of 
probabilistic  inference. 

12(Dempster  &  Shafer  76] 
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- .  .  P(A  I  H)  P(H) 

P(H  I  A) - 

Given  a  fixed  set  of  observations  A  and  certain  simplifying  assumptions,  it  is  possible 
to  determine  the  relative  probabilities  of  various  candidate  hypotheses.  Assume  that 
every  plan  (every  Bi)  has  the  same  prior  probability,  and  exactly  one  possible 
decomposition  (that  is,  a  breakdown  into  actions).  Begin  by  considering  the  case 
where  the  actor  performs  exactly  one  plan  -  that  is,  the  Bi  are  disjoint.  When 
calculating  relative  probabilities,  the  factors  P(A)  and  P(H)  can  be  ignored;  the  first 
because  A  is  fixed,  the  second  because  every  candidate  H  must  be  a  basic  hypothesis. 
The  conditional  probability  of  the  observations  given  the  hypothesis  is  simply: 

P(A  |  H)  =  1  if  H  a  Bi  a  (V  Aj  e  A)  .  component-of(Aj,  Pi) 

=  0  otherwise 

So  the  most  likely  hypotheses  are  simply  those  which  incorporate  all  the  observations. 
But  since  plans  are  parameterized,  there  are  an  infinite  number  of  candidate  H's  to 
consider.  We  need  some  way  of  searching  this  huge  space.  Classical  statistics  seems 
to  have  little  help  to  offer;  statistical  theory  has  concentrated  on  the  special  case  where 
the  various  H  are  of  the  same  known  form  (e.g.  the  percentage  of  red  balls  in  the  um) 
parameterized  only  by  some  numerical  coefficients.  But  here  we  must  deal  with  a  large 
number  of  fundamentally  different  plans  which  take  different  discrete  parameters.  (The 
minimal  model  construction  described  in  this  thesis  corresponds  to  this  search.) 

Relax  the  assumption  that  the  Bj  are  disjoint  Now  it  is  harder  to  compute 
relative  values  for  P(H  |  A).  First,  it  is  no  longer  reasonable  to  assign  the  same  value 
to  all  P(H)  -  particularly  since  some  composite  hypotheses  may  strictly  entail  other 
composite  hypotheses.  Second,  the  conditional  probability  of  the  observed  actions 
given  the  candidate  hypothesis  is  much  less  constrained.  Without  making  additional 
assumptions,  all  one  may  conclude  is  that 

P(A  |  H)  =s  1  if  (V  Aj  €  A)  (3  Bj ).  H  |-  Bi A  component-of(Aj,  Pi) 

<  1  otherwise 

If  one  is  comfortable  assuming  that  the  Bj  are  independent,  so  that 
P(Bi  a  B2  a  ...  a  Bn)  =  P(BO  P(B2) ...  P(B„) 
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then  note  the  following:  if  hypotheses  H  and  H'  contain  the  same  number  of 
conjuncts,  and  if  H  accounts  for  all  of  A  but  H'  does  not,  then  P(H  I  A)  > 
P(H’  |  A).  Furthermore,  if  H  and  H'  both  account  for  A,  the  smaller  hypothesis  has 
the  higher  probability.  Our  framework  for  plan  recognition  has  similar  characteristics. 

We  have  suggested  that  in  plan  recognition  it  is  appropriate  to  invoke  Occam's 
razor  to  prefer  explanations  which  only  require  the  actor  to  have  as  few  unrelated  in¬ 
tentions  at  a  time  as  possible.  This  condition  does  not  quite  fall  out  of  the  probabilistic 
account  so  far,  even  assuming  the  independence  of  the  basic  hypotheses:  it  may  give  a 
high  probability  to  an  H  which  does  not  quite  account  for  all  of  A,  but  would  have  to 
be  greatly  expanded  to  account  for  the  remainder.  This  can  be  partially  remedied  by 
constraining  the  conditional  probability  function  P(A  |  H)  to  be  very  small  for  such  an 
H. 


P(A  |  H)  «P(B0 

if  (3  Aj  e  A)  (V  Bi ).  H  J-  Bi  D  -i  component-of(Aj,  Pj) 

A  further  complication  awaits  in  the  wings:  it  is  not  adequate  to  only  consider 
conjunctive  composite  hypotheses.  Much  of  this  thesis  will  be  concerned  with  abstract 
event  (plan)  types.  An  abstract  plan  stands  for  the  class  of  its  specializations;  that  is,  it 
is  logically  equivalent  to  the  disjunction  of  all  the  different  ways  it  which  it  can  be  fully 
decomposed  with  constants  assigned  to  all  its  parameters.  The  likelihood  that  the  agent 
is  performing  an  abstract  plan  is  the  probability  assigned  to  the  corresponding 
composite  hypothesis,  which  may  be  equivalent  to  an  arbitrary  boolean  combination  of 
basic  hypotheses.  This  more  general  formulation  compounds  the  size  of  space  of 
hypotheses. 

In  the  end,  then,  probability  gives  a  way  to  compare  the  relative  merits  of  alter¬ 
native  hypotheses  about  an  actors  intentions,  but  does  not  automatically  yield  a  way  of 
generating  or  selecting  hypotheses  to  compare.  We  don't  seem  to  need  any  of  the 
high-powered  mathematical  techniques  statisticians  have  developed.  And  still  the 
problem  of  deciding  what  likely  statements  to  accept  as  true  remains. 

In  our  model-theoretic  treatment  of  plan  recognition,  the  statements  that  the  ob¬ 
server  should  accept  are  simply  those  which  are  valid  in  the  minimal  model  construc¬ 
tion.  Using  probabilities,  one  needs  some  sort  of  rule  of  acceptance,  or  at  the  least  be 
prepared  to  deal  with  all  the  complexities  of  allowing  an  agent  to  hold  degrees  of  belief. 
[Kyburg  74]  deals  in  detail  with  the  complexities  of  both  these  problems.  The  present 
situation  appears  particularly  difficult,  because  we  have  not  come  up  with  absolute 
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probabilities,  but  only  a  method  for  ordering  the  relative  probabilities  of  certain  state¬ 
ments.  Thus  no  simple  rule  —  such  as  “accept  H  if  P(H)  >  .95”  will  do. 


1.5.2.  Default  Logic 

Much  everyday  reasoning  employs  "default  rules",  that  allow  conclusions  to  be 
drawn  if  there  is  no  evidence  to  the  contrary.  For  example,  if  all  one  knows  about 
Tweety  is  that  he  is  a  bird,  then  one  may  conclude  by  default  that  Tweety  can  fly.  The 
additional  information  that  Tweety  is  a  penguin  and  that  penguins  don't  fly  would 
disallow  the  previous  conclusion.  [Reiter  80]  developed  an  extension  to  first-order 
logic,  called  default  logic ,  to  handle  this  kind  of  inference. 

How  useful  are  default  rules  for  plan  recognition?  It  is  straightforward  to  cast, 
for  example,  Allen's  plan  recognition  rules  as  default  inference  rules.  Using  Reiter's 
logic,  one  could  write,  for  example,  the  preconditionJ action  rule  as: 

Want(agt,  P)  a  precondition-of(P,  act)  :  M  Want(agt,  act) 

Wantfagt,  act) 

This  says  that  if  an  agent  wants  a  precondition  of  an  act  to  hold,  and  it  is  consistent  that 
(M)  he  wants  the  act,  then  conclude  that  he  wants  the  act  But  little  has  been  gained 
from  the  use  of  default  logic.  Default  logic  has  the  property  that  given  a  set  of  facts  and 
set  of  rules,  one  may  reach  different  and  perhaps  mutually  contradictory  sets  of  conclu¬ 
sions,  depending  on  the  order  in  which  one  applies  the  default  rules.  The  logic  insures 
that  each  set  of  conclusions,  or  extension,  is  internally  consistent  but  gives  no  way 
choosing  between  them.  As  a  basis  for  plan  recognition,  then,  default  logic  suffers  the 
same  criticisms  as  the  dynamic  logic  framework  of  Cohen  &  Levesque:  too  much  of 
the  problem  remains  hidden  in  the  strategy  which  orders  the  application  of  various  rules 
of  inference. 

1.5.3.  Circumscription 

Circumscription  is  a  specialized  form  of  non-monotonic  inference  developed  by 
McCarthy  to  handle  the  "qualification  problem"  in  planning.  McCarthy  wanted  to  be 
able  to  formally  state  that  "the  objects  that  can  be  shown  to  have  a  certain  property  P  by 
reasoning  from  certain  facts  A  are  all  the  objects  that  satisfy  P."  [McCarthy  80]  For 
example,  one  might  have  a  description  of  a  bunch  of  blocks  on  a  table,  and  want  to 
synthesize  a  plan  to  build  a  certain  kind  of  tower.  It  might  be  necessary  to  pick  up  a 
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block  B,  which  can  only  be  performed  if  B  is  clear.  If  one  cannot  prove  that  there  is 
anything  on  top  of  B,  then,  in  general,  one  wants  to  be  able  to  conclude,  by  circum¬ 
scribing  the  predicate  on,  that  nothing  is  on  top  of  B. 

The  circumscription  of  a  predicate  (or  set  of  predicates)  relative  to  a  body  of 
knowledge  is  a  sentence  of  second-order  logic  which  involves  the  entire  collection  of 
facts  at  hand  The  following  formulation  is  drawn  from  [Lifschitz  84].  Let  S[rc]  be  a 
sentence  containing  the  list  of  predicates  jc.  The  expression  S[a]  is  the  sentence  ob¬ 
tained  by  rewriting  S  with  each  member  of  k  replaced  by  the  corresponding  member  of 
a.  The  expression  a  £  x  abbreviates  the  formula  stating  that  the  extension  of  each 
predicate  in  a  is  a  subset  of  the  extension  of  the  corresponding  predicate  in  7t;  that  is 

(Vx  .  <Ti(x)  3  tci(x))  a  ...  a  (Vx  .  an(x)  3  xn(x)) 

where  each  x  is  a  list  of  variables  of  the  proper  arity  to  serve  as  arguments  to  each  <?i. 
The  circumscription  of  jc  relative  to  S,  written  Circum(S[x],7t),  is  the  second-order 
formula 

S[x]  a  V  o  .  (S[o]  Aa£jt)DJt£a 

Chapter  4  shows  how  this  formula  is  generalized  to  allow  other  symbols  to  vary  during 
the  minimization,  as  described  in  [McCarthy  84]. 

Consider  the  following  example.  Let  S[p]  =  p(A)  v  p(B),  and  compute  the 
circumscription  of  p  relative  to  S.  This  is: 

(p(A)  v  p(B))  a  V  a  .  ((a(  A)  v  o(B))  AOSp)3p$a 
Expanding  the  £  abbreviation  gives 

(p(A)  v  p(B))  a 

V  <j .  ((<y(A)  v  a(B))  a  Vx  .  a(x)  3  p(x))  3 
Vx .  p(x)  3  o(x) 

Now  let  us  investigate  what  can  be  concluded  from  this  statement.  Instantiate  the 
predicate  variable  a  with  the  lambda-expression  \(x) .  x=A.  (That  is,  a  is  the  predicate 
which  is  true  of  A  and  nothing  els2.)  This  yields 
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(p(A)  v  p(B))  a 

((A=A)  v  A=B))  a  Vx  .  x=A  3  p(x))  3 
Vx  .  p(x)  3  x=A 

This  statement  reduces  to 

(p(A)  v  p(B))  a 
p(A)  3  (Vx .  p(x)  3  x=A) 

Instantiate  the  circumscription  formula  a  second  time  with  the  expression  X.(x).x=B. 
The  result  is 

(p(A)  v  p(B))  a 
p(B)  3  (Vx  .  p(x)  3  x=B) 

Thus  the  final  conclusion  is  that  one  of  A  or  B  is  a  p,  and  there  is  only  one  p  thing. 
(The  conclusion  is  not  p(A)  exclusive-or  p(B)  because  of  the  possibility  that  A=B.) 

(Vx  .  p(x)  3  x=A)  v  (Vx  .  p(x)  3  x=B) 

This  example  illustrates  the  major  stumbling  block  to  the  use  of  circumscription. 
There  is  no  general  mechanical  way  of  determining  how  to  instantiate  the  predicate  pa¬ 
rameters  in  the  second-order  formula.  The  work  in  this  thesis  overcomes  this  problem 
for  a  particular  class  of  circumscriptions,  and  gives  simple  syntactic  rules  for  comput¬ 
ing  a  first-order  version  of  the  circumscription  formula. 

A  model  of  a  set  of  statements  is  minimal  in  n  if  there  is  no  other  model  of 
those  statements  which  is  identical,  except  that  7t  holds  of  some  things  in  the  first  model 
but  not  in  the  second.13  Section  3.1  formally  defines  this  notion.  [Etherington  86]  in¬ 
cludes  proofs  that  the  circumscription  of  a  predicate  relative  to  a  formula  is  true  in  all 
models  minimal  in  the  predicate.  The  story  of  the  completeness  of  circumscription  rel¬ 
ative  to  this  model  theory  is  a  bit  complicated.  The  notion  of  a  minimal  model  is  pow¬ 
erful  enough  to  capture  the  standard  model  of  arithmetic,  which  cannot  be  axiomatized 
[Davis  80].  While  all  strong  second-order  models  of  the  circumscription  formula  are 


l3McCarthy's  original  version  of  circumscription  did  not  make  reference  to  a  particular  set  of  predicates 
to  be  minimized.  Instead,  the  entire  domain  of  the  model  was  minimized;  thus,  a  minimal  model  of 
formula  was  one  which  had  no  submodels  of  the  formula.  [Etherington  86]  calls  this  version  of 
circumscription  domain  circumscription,  and  has  shown  that  it  is  not  comparable  in  expressive  power 
to  predicate  circumscription.  In  this  thesis,  the  expression  "minimal  model"  always  means  "minimal 
in  some  predicate". 
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minimal  models,  it  is  not  necessarily  the  case  that  all  weak  second-order  models  are 
minimal.  Another  way  of  stating  this  is  that  the  circumscription  formula  is  not  always 
complete  if  one  views  it  as  a  schema  for  recursively  generating  a  set  of  first-order 
statements.14  [Minker  &  Perlis  85]  show  that  circumscription  is  complete  in  this  sense 
if  in  all  models  the  circumscribed  predicate  has  a  finite  extension.  Chapter  3  presents  a 
proof  that  formulas  computed  by  our  closure  function  cl  are  complete  for  the  minimal 
model  semantics. 


14McCarthy's  early  versions  of  circumscription  viewed  the  formula  in  just  this  way,  as  a  first-order 
schema,  rather  than  as  a  tme  second-order  statement 
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Chapter  2 

Representing  Event  Hierarchies 

2.1.  Language 

The  representation  language  is  first-order  predicate  calculus  with  equality.  A 
model  provides  an  interpretation  of  the  language,  mapping  terms  to  individuals,  func¬ 
tions  to  mappings  from  tuples  of  individuals  to  individuals,  and  predicates  to  sets  of 
tuples  of  individuals.  If  M  is  a  model,  then  this  mapping  can  be  made  explicit  by 
applying  M  to  a  term,  function,  or  predicate.  For  example,  for  any  model  M: 

Loves(Sister(Joe)3ill)  is  true  in  M  if  and  only  if 

(M[Sister](M[Joe]),  M[BiU])  e  M[Loves] 

Meta-variables  (not  part  of  the  language)  which  stand  for  domain  individuals  begin  with 
a  colon.  Thus,  one  may  write: 

Let  :C  be  an  event  token  in  Domain (M). 

Models  map  free  variables  in  sentences  to  individuals.  We  write  M{x/:C}  to  mean  the 
model  which  is  just  like  M,  except  that  variable  x  is  mapped  to  individual  :C. 
Quantification  is  defined  as  follows: 

3x  .  p  is  true  in  M  if  and  only  if 

there  exists  :C  e  Domain (M)  such  that  p  is  true  in  M{x/:C} 

Vx .  p  is  true  in  M  if  and  only  if 
-i  3x .  -ip  is  true  in  M 

The  propositional  connectives  are  semantically  interpreted  in  the  usual  way. 

Certain  finite  models  can  be  completely  specified  by  listing  all  the  positive 
atoms  which  hold  in  the  model,  other  than  trivial  instances  of  equality  (i.e.,  of  the  form 
a=a).  For  example,  a  model  in  which  the  only  predicate  with  a  non-empty  extension  is 
Fat,  which  has  two  elements,  may  be  specified  by  writing: 

{  Fat(A),  Fat(B) } 

FOL  proofs  in  this  thesis  use  natural  deduction,  freely  appealing  to  obvious 
lemmas  and  transformations.  It  is  convenient  to  distinguish  a  set  of  constant  symbols 
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called  individual  parameters,  or  just  parameters,  for  use  in  the  deductive  rule  of  exis¬ 
tential  elimination.  The  rule  allows  one  to  replace  an  existentially-quantified  variable  by 
a  parameter  which  appears  at  no  earlier  point  in  the  proof.  Parameters  are  distinguished 
by  the  prefix  *.  Technically,  no  parameters  may  appear  in  the  final  step  of  the  proof: 
they  must  be  replaced  again  by  existentially-quantified  variables  (or  eliminated  by  other 
means).  This  final  step  is  omitted  when  it  is  obvious  how  it  should  be  done.  For  ex¬ 
ample,  we  may  write 

The  system  concludes  E1(*C)  v  E2(*C). 

instead  of 


The  system  concludes  3x  .  El(x)  v  E2(x). 

2.2.  Representation  of  Time,  Properties,  and  Events 

Most  formal  work  on  representing  action  has  relied  on  the  situation  calculus  or 
dynamic  logic  [Harel  79].  While  these  formalisms  are  convenient  for  planning,  they 
prove  awkward  for  plan  recognition:  it  is  impossible  (without  extreme  convolutions; 
see  [Cohen  84])  to  state  that  some  particular  action  actually  occurred  at  a  particular  time. 
We  therefore  adopt  a  "reified"  representation  of  time  and  events. 

Time  is  linear,  and  time  intervals  are  individuals,  each  pair  related  by  one  of 
Allen's  interval  logic  relations:  Before,  Meets,  Overlaps,  etc.  [Allen  83b].  The  names 
of  several  relations  may  be  written  in  place  of  a  predicate,  in  order  to  stand  for  the  dis¬ 
junction  of  those  relations,  each  applied  to  the  same  argument  pair.  For  example,  the 
expression 


BeforeMeets(Tl,T2) 
is  an  abbreviation  for  the  formula 

Before(Tl,T2)  v  Meets(Tl,T2) 

As  a  special  case,  the  predicates  Within  and  Disjoint  abbreviate  common  disjunc¬ 
tions: 


Within(Tl,T2)  a 

Starts(Tl,T2)  v  During(Tl,T2)  v  Finishes(Tl,T2) 
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Disjoint(Tl,T2)  a 

Before(Tl,T2)  v  Meets(Tl,T2)  v  MetBy(Tl,T2)  v  After(Tl,T2) 

Event  tokens  are  also  individuals,  and  event  types  are  represented  by  unary 
predicates.  All  event  tokens  are  real;  there  are  no  imaginary  or  "possible"  event  tokens. 
Various  functions  on  event  tokens,  called  roles,  yield  parameters  of  the  event  Role 
functions  include  the  event's  agent  and  time.  For  example,  the  formula 

ReadBook(C)  a  agent(C)=Fred  a  time(C)=T2 

may  be  used  to  used  to  represent  the  fact  that  an  instance  of  booking  reading  occurs;  the 
agent  of  the  reading  is  Fred;  and  the  time  of  the  reading  is  (the  interval)  T2. 

The  reader  may  wonder  why  we  have  not  used  functions  to  construct  event  to¬ 
kens  from  their  parameters.  For  instance,  one  might  want  to  specify  the  above  action 
as: 


ReadBook(Frcd,  T2) 

Indeed,  many  researchers  have  taken  the  line  that  an  event  type,  together  with  its  pa¬ 
rameters  and  time  of  occurrence  completely  specifies  an  event  token  [Pollack  86,  Allen 
84].  There  arc  two  objections  to  this  alternative.  First  is  that  it  is  not  always  possible 
to  specify  all  the  parameters  of  an  event  beforehand.  Should  ReadBook  also  have  a 
parameter  for  the  thing  being  read?  What  about  whether  the  agent  is  skimming  or 
reading  carefully?  And  so  on.1  A  second  problem  occurs  when  we  allow  very  abstract 
event  types,  like  "ReadSomething",  which  holds  of  any  reading  event,  or 
"Gainlnformation",  which  holds  of  any  instance  of  reading,  watching  television,  lis¬ 
tening  to  the  radio,  etc.  It  is  possible  for  several  distinct  instances  of  the  same  abstract 
event  type  to  occur  simultaneously.  Thus  it  is  not  sufficient  to  only  represent  event 
types  and  times;  explicit  event  tokens  are  needed  as  welL 

Timeless  propositions  are  called  facts.  Facts  are  represented  by  ordinary  predi¬ 
cates.  For  example,  the  fact  that  John  is  a  human  may  be  represented  by  the  formula 

Human(John) 

The  predicate  Holds  relates  a  time-dependent  property  and  a  time  interval  over  which 
the  property  holds.  Properties  are  individuals,  and  are  therefore  represented  by  terms. 
For  example,  the  fact  that  John  is  unhappy  at  time  T1  may  be  represented  by  the  for- 

1Thii  is  the  same  problem  people  working  on  case  grammar  [Fillmore  68]  ran  into. 
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mula 


Holds(unhappy(John),Tl) 

The  term  unhappy  (John)  denotes  the  proposition  that  John  is  unhappy.  Every  distinct 
property  term  represents  a  distinct  property.  This  is  enforced  by  a  set  of  property  iden¬ 
tity  axioms .  The  axioms  take  the  following  form. 

For  all  distinct  p,a  e  Property  Terms: 

V  xi,...  xn  .  p(xi,...  xn)  *  o(xi,...  xn) 

V  xi,...  ,xn,  yi, ...  yn  •  p(xi,...  Xn)  —  p(yi,...  yn) 

xj*yi  a  ...  a  xn=yn 

All  properties  are  dense:  if  one  holds  over  an  interval,  then  it  holds  over  all  subinter¬ 
vals  of  that  interval.  The  predicate  Never  holds  of  a  property  and  a  time  when  the 
property  holds  over  no  subinterval  of  the  time.  The  following  axioms  capture  these 
constraints: 


Vp,ti,t2  •  Holds(p.ti)  a  Within(t2,ti)  3  Holds(p,t2) 

Vp.ti  .  Never(p,ti)  a  (Vt2  .  Holds(p,t2)  3  Disjoint^, t2)) 

In  addition  to  the  relations  between  individuals,  it  is  sometimes  convenient  to 
talk  about  relations  between  predicates.  These  meta-relations,  such  as  "abstracts"  and 
"component",  are  not  part  of  the  logic  itself. 

2.3.  The  Event  Hierarchy 

An  event  hierarchy  is  a  collection  of  restricted-form  axioms,  and  may  be  viewed 
as  a  logical  encoding  of  a  semantic  network,  as  in  [Hayes  85].  These  axioms  represent 
the  abstraction  and  decomposition  relations  between  event  types.  An  event  hierarchy  H 
contains  the  following  parts,  Hg,  H^,  Hgg,  Hp,  and  Hq: 

•Hg  is  the  set  of  unary  event  type  predicates.  Hg  contains  the  distinguished  predicates 
AnyEvent  and  End. 

•ha  is  the  set  of  abstraction  axioms,  each  of  the  form: 
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Vx  .  Ei(x)  3  E2(x) 

for  some  Ei,  E2  e  Hg.  In  this  case  we  say  that  E2  directly  abstracts  Ei.  The  transi¬ 
tive  closure  of  direct  abstraction  is  abstraction;  and  the  fact  that  E2  is  the  same  as  or  ab¬ 
stracts  Ei  is  written  E2  abstracts*  Ei.  AnyEvent  abstracts*  all  event  types. 

«Hfr  is  the  set  of  basic  event  type  predicates,  those  members  of  Hg  which  do  not  ab¬ 
stract  any  other  even,,  type. 

•hd  is  the  set  of  decomposition  axioms,  each  of  the  form: 

Vx  .  Eq(x)  d  Ej(fi(x))  a  E2(f2(x))  a  ...  a  En(fn(x))  a  k 

where  Eq,  ....  En  e  Hg;  fj, ....  fn  are  role  functions;  and  k  is  a  subformula  contain¬ 
ing  no  member  of  He-  The  formula  k  describes  the  constraints  on  Eo-  Ei  through  En 
are  called  direct  components  of  Eo-  Sometimes  we  refer  to  a  component  by  the  name  of 
its  role  function;  e.g.,  Ei  is  the  fi  direct-component  of  Eo.  The  type  End  never  appears 
as  a  direct  component  of  another  type;  nor  does  any  type  which  End  abstracts. 

•Hq  is  the  set  of  general  axioms,  those  which  do  not  contain  any  member  of  He-  Hq 
includes  the  axioms  for  the  temporal  interval  relations;  the  density  axioms  for  Holds 
and  Never,  the  property  identity  axioms;  as  well  as  any  other  facts  not  specifically  re¬ 
lating  to  events. 

2.4.  Components  of  Event  T okens 

The  component  relation  may  be  applied  to  event  tokens  in  a  model  M  as  fol¬ 
lows.  Suppose  :Q  and  :Co  are  event  tokens.  Then  :Q  is  a  direct  component  of  :Cq  in 
M  if  and  only  if 

(i)  there  are  event  types  Ei  and  Eq,  such  that  :Q  e  M[Ej]  and  :Cq  e  M[Eo] 

(ii)  Hq  contains  an  axiom  of  the  form: 

Vx  .  Eq(x)  3  Ei(fi(x))  a  ...  a  Ej(fj(x))  a  ...  a  En(fn(x))  A  K 

(iii)  :Ci=M[fi](:C0) 
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The  component  relation  is  the  transitive  closure  of  the  direct  component  relation,  and 
the  fact  that  :Cn  is  either  the  same  as  or  a  component  of  :Cq  is  written  :Cn  is  a  compo¬ 
nent*  of :  Co- 

Note  that  the  component  relation  over  event  tokens  does  not  correspond  to  the 
transitive  closure  of  the  direct-component  meta-relation  over  event  types.  This  is  due  to 
the  fact  that  a  token  may  be  of  more  than  one  type. 

2.5.  Acyclic  Hierarchies  &  Compatible  Types 

An  acyclic  hierarchy  is  one  that  can  be  exhaustively  searched  in  finite  time,  and 
is  formally  defined  as  follows.  Two  event  predicates  and  E2  are  compatible  if  there 
is  an  event  type  E^  such  that  both  E^  and  E2  abstract*  E^.  A  hierarchy  if  acyclic  if  it 
contains  no  series  of  event  predicates  Ej,  E2, ...,  Eq  such  that: 

(i)  E-  is  a  direct  component  of  E{+  \  for  odd  i,  1  <;  i  <;  n-1 

(ii)  Ej  is  compatible  with  Ej+i  for  even  j,  2  £  j  £  n-2 

(iii)  En  is  compatible  with  E1 

We  will  consider  only  acyclic  hierarchies  in  this  thesis,  although  most  results  should 
extend  to  cyclic  hierarchies  as  well. 

Roughly  speaking,  a  lexical  hierarchy  is  cyclic  if  an  event  token  may  have  an 
event  of  the  same  type  as  a  component  The  presence  of  the  abstraction  hierarchy, 
however,  makes  this  definition  too  generally  applicable.  All  events  are  of  type 
AnyEvent;  therefore  any  event  token  will  share  at  least  the  type  AnyEvent  with  its  com¬ 
ponents.  The  previous  definition  avoids  this  problem  by  consider  only  the  meta-struc¬ 
ture  between  event  type  predicates. 

2.6.  Example:  The  Cooking  World 

The  actions  involved  in  cooking  form  a  interesting  yet  tractable  domain  for 
planning  and  plan  recognition.  The  specialization  relations  between  various  kinds  of 
foods  are  mirrored  by  specialization  relations  between  the  actions  which  create  those 
foods.  Decompositions  are  associated  with  the  act  of  preparing  a  type  of  food,  in  the 


7A-D-43 


manner  in  which  a  recipe  spells  out  the  steps  in  the  food's  preparation.  A  good  cook 
stores  information  at  various  levels  in  his  or  her  abstraction  hierarchy.  For  example, 
one  knows  certain  actions  which  are  needed  to  create  any  cream-based  sauce,  as  well  as 
certain  conditions  (constraints)  which  must  hold  during  the  preparation.  The  sauce 
must  be  stirred  constantly,  the  heat  must  be  moderate,  and  so  on.  A  specialization  of 
the  type  cream-sauce,  such  an  Alfredo  sauce,  adds  steps  and  constraints:  e.g.,  one 
should  slowly  stir  in  grated  cheese  at  a  certain  point  in  the  recipe. 

The  cook  and  the  observer  have  the  same  knowledge  of  cooking,  a  hierarchi¬ 
cally-arranged  cookbook.  (Most  real  cookbooks  are  "flat",  of  course;  years  of  experi¬ 
ence  are  required  to  induce  the  hierarchical  structure.)  Actions  of  the  cook  are  reported 
to  the  observer,  who  tries  to  infer  what  the  cook  is  making.  We  do  not  assume  that  the 
reports  are  exhaustive  -  there  may  be  unobserved  actions  -  although  such  an 
assumption  could  be  made,  without  changing  our  framework:  we  would  simply  add 
additional  observational  reports,  of  the  form  "nothing  else  happened"  over  certain  time 
periods.  A  cook  may  prepare  several  different  dishes  at  the  same  time,  so  it  is  not 
always  possible  to  assume  that  all  observations  are  part  of  the  same  recipe.  Different 
End  events  may  share  steps.  For  example,  the  cook  may  prepare  a  large  batch  of 
tomato  sauce,  and  then  use  the  sauce  in  two  different  dishes. 

We  do  not  aim  to  capture  a  lot  of  domain-specific  information  about  cooking: 
rather,  cooking  appears  to  be  one  of  the  most  general  "toy  worlds"  one  can  consider. 
(For  interesting  work  on  planning  in  the  cooking  world,  see  [Schmolze  86].) 
Techniques  which  work  in  the  cooking  world  should  translate  to  almost  any  other  do¬ 
main. 


2.6.1.  Diagrammatic  Form 

The  following  diagram  illustrates  a  very  tiny  cooking  hierarchy.  Thick  grey  ar¬ 
rows  denote  the  abstraction  meta-relation,  while  thin  black  arrows  denote  the  direct 
component  meta-relation.  All  event  types  are  abstracted  by  AnyEvent.  As  discussed 
above.  Ends  are  a  special  kind  of  event,  which  are  not  components  of  any  other  event. 
Here  there  are  two  main  categories  of  End  events:  preparing  meals  and  washing  dish¬ 
es.  It  is  important  to  understand  that  the  abstraction  hierarchy,  encoded  by  the  axioms 
in  H^,  and  the  decomposition  hierarchy,  encoded  by  the  axioms  in  Hq,  are  interrelated 
but  separate.  Work  on  hierarchical  planning  often  confuses  these  two  distinct  notions 
in  an  action  or  event  hierarchy. 
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Any 


figure  2.1:  Cooking  Hierarchy 


2.6.2.  The  Abstraction  Hierarchy 

The  diagram  suggests  some  of  the  elements  which  make  up  the  lexical  hierarchy. 

•  The  set  of  event  types,  Hg,  includes  PrepareMeal,  MakeNoodles,  MakeFettucini, 
and  so  on. 

•  The  abstraction  axioms,  H^,  assert  that  every  MakeSpaghetti  and  every 
MakeFettucini  is  also  a  MakeNoodles,  that  every  MakePastaDish  is  also  a  PrepareMeal, 
and  so  on.  A  traditional  planning  system  might  call  MakeSpaghetti  and  MakeFettucini 
different  bodies  of  the  MakeNoodles  plan. 
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Vx  .  MakeSpaghctti(x)  r>  MakeNoodles(x) 

Vx  .'MakeFettucini(x)  z>  MakeNoodles(x) 

Vx .  MakePastaDish(x)  3  PrepareMeal(x) 

•The  basic  event  types,  Hgg,  appear  at  the  bottom  of  the  abstraction  (grey)  hierarchy. 
These  include  the  types  WashDishes,  Boil,  MakeSpaghettiMarinara, 
MakeChickenMarinara,  MakeFettucini,  MakeSpaghetti,  and  MakeMarinara.  Note  that 
basic  event  types  may  have  components  (but  no  specializations). 

2.6.3.  The  Decomposition  Hierarchy 

•The  decomposition  axioms,  Hp,  include  much  information  which  does  not  appear  in 
the  diagram.  These  axioms  specify  the  role-functions  which  link  an  event  to  its 
components,  and  the  constraints  which  hold  between  those  steps  and  the  event. 
Following  is  (an  abbreviated  version  of)  the  decomposition  axiom  for  the 
MakePastaDish  event  This  act  includes  at  least  three  steps:  making  noodles,  making 
sauce,  and  boiling  the  the  noodles.  The  equality  constraints  assert  among  other  things, 
that  the  agent  of  each  step  is  the  same  as  the  agent  of  the  overall  act;  and  that  the 
noodles  the  agent  makes  (specified  by  the  result  role  function  applied  to  the 
MakeNoodles  step)  are  the  thing  boiled  (specified  by  the  input  role  function  applied  to 
the  Boil  step).  Temporal  constraints  explicitly  state  the  temporal  relations  between  the 
steps  and  the  MakePastaDish.  For  example,  the  time  of  each  step  is  during  the  time  of 
the  MakePastaDish,  and  the  Boil  must  follow  the  MakeNoodles.  The  constraints  in  the 
decomposition  include  the  preconditions  and  effects  of  the  events.  Preconditions  for 
MakePastaDish  include  that  the  agent  is  in  the  kitchen  during  the  event,  and  that  the 
agent  is  dexterous  (making  pasta  by  hand  is  no  mean  feat!).  An  effect  of  the  event  is 
that  there  exists  something  which  is  a  PastaDish,  the  result  of  the  event,  which  is 
ready  to  eat  during  a  time  period  postTime,  which  immediately  follows  the  time  of  the 
cooking  event 
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Vx .  MakePastaDish(x)  d 

MakeNoodles(stepl(x))  a 
MakeSauce(step2(x))  a 
Boil(step3(x))  a 
agent(stepl(x))  =  agent(x)  a 
result(stepl(x))  =  input(step3(x))  a 
During(time(stepi(x)'>,  time(x))  a 
BeforeMeets(time(stepl(x)),  time(step3(x)))  a 
Overlaps(  time(x),  postTime(x))  a 
Holds(inKitchen(agent(x)),  time(x))  a 
Dexterous(agent(x))  a 

Holds(readyToEat(result(x)),  postTime(x))  a 
PastaDish(result(x)) 

Note  that  the  names  of  the  component  roles,  stepl,  step2,  etc.,  are  arbitrary;  they  do 
not  indicate  temporal  ordering.  The  event  types  which  specialize  MakePastaDish  add 
additional  constraints  and  steps  to  its  decomposition.  For  example,  the  event  type 
MakeSpaghettiMarinara  further  constrains  its  decomposition  to  include  MakeSpaghetti 
(rather  than  the  more  generic  MakeNoodles)  and  MakeMarinaraSauce  (rather  than  sim¬ 
ply  MakeSauce).  One  could  also  add  completely  new  steps  as  well. 

Vx .  MakeSpaghettiMarinara(x)  o 

MakeSpaghetti(stepl(x))  a 
MakeMarinaraSauce(step2(x))  a  . . . 

2.6.4.  Describing  Instances  of  Events 

Assertions  about  particular  event  instances  take  the  form  of  the  predication  of  an 
event  type  of  a  constant,  conjoined  with  equality  assertions  about  the  roles  of  the  event 
token,  and  perhaps  a  proposition  relating  the  time  of  the  event  to  that  of  other  events. 
The  English  statement,  "Joe  made  the  noodles  on  the  table  yesterday"  may  be  repre¬ 
sented  as  follows: 

MakeNoodle(Make33)  a 
agent(Make33)  =  Joe  a 
result(Make33)  =  Noodles72  a 

Holds(onTable(Noodles72),  Tnow)  a 
During(  time(Make33),  Tyesterday ) 


Components 

Equality 

Constraints 

Temporal 

Constraints 

Preconditions 

Effects 
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2.7.  Conditional  Actions 


Plans  of  action  often  contain  conditional  actions.  If  some  condition  holds,  then 
the  plan  (or  event  type)  contains  a  certain  step;  otherwise,  it  contains  a  different  step. 
At  first  glance,  the  form  of  a  lexical  hierarchy  does  not  seem  to  allow  for  the  presence 
of  conditional  steps  in  the  decomposition  of  an  event  type.  Consider  the  case  where  if  a 
condition  P  holds,  event  type  EO  should  contain  a  step  of  type  E-p;  otherwise,  it  con¬ 
tains  one  of  type  Ep.  One  may  want  to  write  something  like  the  following. 

Vx .  E0(x)  3  (P  d  (Et(st(x))  a  kt))  a 
HP  =3  (Ep(sp(x))  A  Kp))  A 
E2(s2(x))  a  ...  a  k 

The  expressions  icp  and  Kp  introduce  constraints  which  only  hold  on  the  conditional 
part  of  the  event  This  formula  is  clearly  not  of  the  form  specified  for  axioms  in  H]> 
Does  this  mean  that  all  the  work  in  this  thesis  that  depends  on  lexical  hierarchies  must 
be  redone  in  order  to  account  for  conditional  events? 

2.7.1.  Representing  Conditional  Actions 

Fortunately,  the  answer  is  no.  The  form  of  a  lexical  hierarchy  is  sufficiently 
general  to  accommodate  conditional  events,  through  the  introduction  of  additional  ab¬ 
stract  event  types.  Consider  the  case  above.  There  are  two  clear  ways  of  specializing 
the  event  type  EO.  In  the  first  category,  P  holds,  and  EO  contains  the  component  Ep. 
In  the  second  category,  P  does  not  hold,  and  EO  contains  the  component  Ep.  This 
situation  is  captured  by  the  following  axioms. 

Vx  .  E0(x)  3  E2(s2(x))  a  ...  a  k 

Vx.E0t(x)=>  E0(x) 

Vx  .  EOp(x)  3  EO(x) 

Vx  .  EOp(x)  3  Et(sj(x))  a  kj  a  P 
Vx  .  EOp(x)  3  Ep(sp(x))  A  Kp  A  -iP 

It  is  plain  that  these  axioms  are  of  the  proper  form  to  appear  in  Ha  and  H^.  The  pro¬ 
cess  that  transformed  the  first  statement  into  the  second  set  of  statements  could  be  re- 
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peated,  in  case  EO  contains  more  than  one  conditional  expression.  In  fact,  this  forms  a 
proof  that  conditional  statements  can  always  be  "factored  out"  of  decomposition  ax¬ 
ioms. 


This  is  not  the  only  possible  way  of  representing  conditionals  in  a  lexical  hier¬ 
archy.  Another  method  would  be  to  replace  the  conditional  expression  in  the 
decomposition  of  EO  by  a  new  event  type,  and  then  give  two  specializations  for  that 
event  type.  The  result  could  be  as  follows. 

Vx .  E0(x)  o  Ep(sl(x))  a  Kp  a 
E2(s2(x))  a  ...  A  K 

Vx.Epr(x)o  Ep(x) 

Vx  .  Epp(x)  3  Ep(x) 

Vx  .  Epr(x)  3  Et(st(x))  a  Kp  a  P 

Vx  .  Epp(x)  3  Ep(sp(X»  AKfA-iP 

The  new  constraint  expression  Kp  in  the  first  axiom  makes  sl(x)  have  all  the  same  roles 
as  x,  except  for  the  new  role  si  itself.  This  is  necessary  so  the  constraint  expressions 
Kp  and  Kp  in  the  final  two  axioms  properly  constrain  sp(x)  and  sp(x).  The  advantage 
of  this  method  is  that  it  results  in  fewer  axioms  if  the  initial  decomposition  axiom  con¬ 
tains  many  conditional  expressions.  Where  n  is  the  number  of  conditionals,  the  former 
method  could  introduce  as  many  as  0(2n)  axioms,  while  the  latter  method  only  intro¬ 
duces  0(n)  axioms. 

Some  question  might  arise  in  the  case  where  half  the  conditional  is  empty. 
Suppose  that  a  certain  action  had  to  be  performed  only  if  P  held: 

Vx  .  E0(x)  o  (Po  (Ep(sp(x))  a  Kp))  a 
E2(s2(x))  a  ...  a  k 

Either  transformation  goes  through  as  before,  with  the  elimination  of  the  atom 
Ep(sp(x)).  One  of  the  axioms  generated  by  the  second  method,  however,  looks  a  bit 
odd.  The  decomposition  axiom  for  the  event  Epp  contains  no  event- types  in  its  con¬ 
sequence: 
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Vx  .  EppOO  3  -iP 


What  is  intuitive  meaning  of  the  event  type  Epp?  It  is  a  kind  of  "null  event".  One  will 
never  directly  observe  an  instance  of  Epp;  however,  from  3x.E0(x)  and  ->P  one  can 
deduce  that  Epp  occurred.  Unlike  some  accounts  of  null  actions,  we  do  not  insist  that 
a  null  action  occurs  over  all  time  periods  in  which  the  world  is  unchanged. 

2.7.2.  An  Example 

The  PickUp  action  in  the  cooking  world  depends  critically  upon  the  temperature 
of  the  object  being  picked  up.  If  the  object  is  hot,  the  agent  must  wear  a  mit  to  avoid 
being  burned.  Therefore  the  plan  for  picking  up  an  object  must  begin  with  the  action  of 
conditionally  putting  on  a  mit  if  the  object  is  hot  This  can  be  encoded  in  an  event  hier¬ 
archy  as  follows. 

Vx .  PickUp(x)  3 

Grasp(s2(x))  a  object(x)»object(s2(x))  a  ... 

Vx  .  PickUpHot(x)  3  PickUp(x) 

Vx  .  PickUpCool(x)  3  PickUp(x) 

Vx .  PickUpHot(x)  3 

Holds(Hot(object(x)),  time(x))  a 

PutOnMit(si(x))  a  BeforeMeet(time(sl(x)),time(s2(x)))  a  ... 

Vx .  PickUpCool(x)  3 

Never(Hot(object(x)),  time(x)) 

Every  PickUp  includes  grasping  the  object,  lifting  it,  and  so  on.  PickUps  are  either  of 
hot  or  cool  objects.  Picking  up  a  hot  object  requires  a  preliminary  step  of  putting  on  a 
mit  Picking  up  a  cool  object  is  constrained  to  occur  only  when  the  object  is  never  hot 
during  the  time  of  the  PickUp. 
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Chapter  3 
Covering  Models 

We  have  seen  that  there  are  too  many  models  of  an  event  hierarchy  to  construct 
a  semantic  basis  for  recognition.  A  technique  known  as  model  minimization  can  be 
used  to  select  a  suitable  subset,  called  covering  models.  In  a  covering  model,  any  non- 
End  event  is  a  component  of  some  End  event  Each  covering  model  for  an  observation 
serves  as  an  explanation,  in  terms  of  End  events,  of  the  observation.  While  it  would  be 
unwise  to  arbitrarily  adopt  a  particular  covering  model,  it  is  reasonable  to  conclude 
whatever  propositions  hold  in  all  covering  models.  These  propositions  are  c-entailed 
by  the  observation.  The  sequence  of  model  minimizations  used  to  construct  the  cover¬ 
ing  models  corresponds  to  a  complex  application  of  McCarthy's  predicate  circumscrip¬ 
tion  schema.  An  easily-computed  set  of  completeness  assumptions  provides  a  complete 
proof-theoretic  description  of  the  covering  models. 

Recall  that  a  lexical  hierarchy  H  contains  two  major  parts:  an  abstraction 
hierarchy,  H^,  and  a  decomposition  hierarchy,  Hp).  Each  of  these  hierarchies  must  be 
strengthened  in  order  to  be  used  for  recognition.  The  abstraction  hierarchy  is 
strengthened  by  assuming  that  there  are  no  event  types  outside  of  He,  and  that  all  ab¬ 
straction  relations  between  event  predicates  are  derivable  from  H^.  The  decomposition 
hierarchy  is  strengthened  by  assuming  that  non-End  events  occur  only  as  components 
of  other  events. 

These  assumptions  are  reasonable  because  H  encodes  all  of  our  knowledge  of 
events.  If  the  hierarchy  is  enlarged,  the  assumptions  must  be  revised.  (This  thesis 
does  not  deal  with  learning,  but  may  be  compatible  with  various  learning  strategies.  An 
obvious  approach  is  to  try  to  explain  the  observations  while  assuming  H  is  complete;  if 
there  are  no  covering  models  of  small  cardinality  in  End,  then  relax  some  of  the  com¬ 
pleteness  assumptions.)  At  the  conceptual  level  one  can  imagine  computing  all  the 
completeness  assumptions  each  time  the  hierarchy  is  modified.  An  implementation,  of 
course,  could  incrementally  build  structures  representing  the  assumptions  as  each  part 
of  the  hierarchy  is  created. 

3 . 1  Model  Minimization 
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Let  Mj  be  a  member  of  a  class  of  models  p,  and  let  tc  be  a  set  of  predicates.  Mi  is 
minimal  in  %  among  p.  if  and  only  if  there  does  not  exist  any  other  model  M2  such 
that: 


1.  M2  is  a  member  of  \L 

2.  Mi  and  M2  have  the  same  domain. 

3.  Mi  and  M2  agree  on  the  interpretation  of  all  constants,  functions, 

and  predicates  not  in  tc. 

3.  The  extension  of  every  member  of  tc  in  M2  is  a  subset  of  the  exten¬ 
sion  of  that  predicate  in  M^ 

5.  The  extension  of  some  member  of  tc  in  M2  is  a  proper  subset  of  the 
extension  of  that  predicate  in  M^. 

If  Mi  fails  to  be  minimal  because  of  such  an  M2,  we  say  that  M2  defeats  the  candi¬ 
dacy  of  Mi. 

3.2  Completing  the  Abstraction  Hierarchy 

The  following  conditions  incrementally  define  the  A-closed  models  of  H.  In  each 
case,  M  is  a  model  of  Ha- 

•  M  is  closed  under  specialization  if  M  is  minimal  in  H£-H£b 

among  models  of  Ha-  That  is,  all  non-basic  event  types  are 
minimized. 

•  M  is  closed  under  abstraction  if  M  is  minimal  in 

H£-{  Any  Event}  among  models  of  Ha  which  are  closed  under 
specialization. 

•  Finally,  we  define  M  to  be  an  A-closed  model  of  H  just  in  case  M  is 

a  model  of  H,  and  M  is  also  a  model  of  Ha  which  is  closed  un¬ 
der  abstraction. 

The  following  theorems  describe  the  A-closed  models  in  proof-theoretic  terms. 
Proofs  appear  in  the  Appendix.  Theorem  3.1  says  that  the  given  specializations  of  each 
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type  are  assumed  to  be  exhaustive.  Theorem  3.5  shows  that  we  have  assumed  that  all 
event  types  are  disjoint,  unless  H  explicitly  states  otherwise.  (Disjointedness  assump¬ 
tions  are  used  heavily  in  recognition,  as  demonstrated  below.  We  do  not  assert  that 
different  event  types  cannot  occur  simultaneously;  only  that  a  particular  token  cannot  be 
of  two  non-compatible  types.)  Theorem  3.7  says  that  every  event  is  of  exactly  one  ba¬ 
sic  type.  Theorem  3.9  presents  a  complete  axiomatization  of  the  A-closed  models. 

3.2.1.  Theorem  3.1  (Exhaustiveness) 

Suppose  (Ej,  E2 . Eq}  are  all  the  predicates  directly  abstracted  by  Eo  in  Ha-  Then 

the  statement 

Vx  .  Eo(x)  3  (Ei(x)  v  E2(x)  v  ...  v  En(x)) 

is  true  in  all  models  of  Ha  which  are  closed  under  specialization.  The  statement  is  also 
true  in  all  A-closed  models  of  H. 

3.2.2.  Theorem  3.5  (Disjointedness) 

If  event  predicates  Ei  and  E2  are  not  compatible,  then  the  statement 

Vx .  -iEi(x)  v  -ds2(x) 

is  true  in  all  models  of  Ha  which  are  closed  under  abstraction.  The  statement  is  also 
true  in  all  A-closed  models  of  H. 

3.2.3.  Theorem  3.7  (Unique  Basic  Types) 

If  Mi  is  a  model  of  Ha  closed  under  abstraction  containing  event  token  :Ci,  then  there 
is  a  unique  basic  event  type  Efc  such  that  :C  €  Mi[E|jJ.  Any  event  type  which  holds  of 
:C  abstracts41  E5. 

3.2.4.  Theorem  3.9  (Abstraction  Completeness) 

Let  EXA  be  the  set  of  all  statements  which  instantiate  Theorem  3.1,  and  let  DJA  be  the 
set  of  all  statements  which  instantiate  Theorem  3.5  for  a  particular  H.  Mi  is  an  A- 
ciosed  model  of  H  if  and  only  if  Mi  is  a  model  of  H  u  EXA  u  DJA. 


7A-D-  53 


3.3  Completing  the  Decomposition  Hierarchy 

Once  the  abstraction  hierarchy  has  been  closed,  it  is  a  simple  matter  to  close  the 
decomposition  hierarchy,  by  minimizing  the  set  of  non-End  event  types.  C-entailment 
is  then  defined  in  terms  of  the  covering  models.  Theorem  3.10  says  that  every  non- 
End  event  must  be  a  component  of  some  other  event,  and  Theorem  3.11  allows  one  to 
infer  the  disjunction  of  possible  uses  of  observed  event  token.  By  Theorem  3. 13  every 
event  is  part  of  an  End  event,  and  by  3.14  the  upward-inference  assumptions  exacdy 
axiomatize  the  covering  models.  Theorem  3.15  states  the  obvious  corollary  that  c-en¬ 
tailment  is  computable.  Theorem  3.16  shows  that  one  must  consider  all  the  possible 
abstractions  and  specializations  of  an  event  in  order  to  account  for  all  of  its  possible 
uses  —  and  therefore  predicate  completion  in  the  style  of  [Clark  78]  does  not  corre¬ 
spond  to  decomposition  completion. 

3.3.1.  Definition  of  Covering  Model  and  C-Entailment 

M  is  a  covering  model  of  H  if  M  is  minimal  in  Hg-{End}  among  A-closed  models 
of  H.  Then  T  c -entails  Q,  written 


when  £1  holds  in  all  covering  models  of  H  in  which  T  holds.  If  H  holds  for  any  I\  Q 
is  c-valid. 


3.3.2.  Theorem  3.10  (No  Useless  Events) 

Let  be  a  covering  model  of  H,  containing  event  token  :Cj.  Then  either 
:Ci  €  Mi  [End]  is  true,  or  there  exists  some  event  token  :C2  such  that  :Ci  is  a  direct 
component  of  :C2- 

3.3.3.  Theorem  3.11  (Component/Use) 

Let  E  €  HE,  and  Com(E)  be  the  set  of  event  predicates  with  which  E  is  compatible. 
Consider  all  the  decomposition  axioms  in  which  any  element  of  Com(E)  appears  on  the 
right-hand  side.  The  j-th  such  decomposition  axiom  has  the  following  form,  where  Eii 
is  the  element  of  Com(E): 
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Vx  .  Ejq(x)  o  Hj i (fj i(x))  a  ...  a  Eji(fji(x))  a  ...  a  Ejn(fjn(x))  a  ic 

Suppose  that  the  series  of  these  axioms,  where  an  axiom  is  repeated  as  many  times  as 
there  are  members  of  Com(E)  in  its  right-hand  side,  is  of  length  m  >  0.  Then  the  fol¬ 
lowing  statement  is  c-valid: 

Vx  .  E(x)  3  End(x)  v 

(3y  •  Ei,o(y) A  fii(y)*x)  v 
(3y  •  E2,o(y)  a  f2i(y)»x)  v 

...  V 

(3y  •  Em,0(y)  A  fmi(y)=x) 

3.3.4.  Theorem  3.13  (No  Infinite  Chains) 

If  Mi  is  a  covering  model  of  H  such  that  Ci  e  Mi[E],  then  there  is  a  :Cn  such  that 
:Cn  €  Mi  [End]  and  :Ci  is  a  component*  of  :Cn. 

3.3.5.  Theorem  3.14  (Decomposition  Completeness) 

Let  CUA  be  the  set  of  all  formulas  which  instantiate  Theorem  3.11  for  a  particular  H. 
M  i  is  a  covering  model  of  H  if  and  only  if  Mi  is  a  model  of 
H  u  EXA  u  DJA  u  CUA. 

3.3.6.  Theorem  3.15  (Computability  of  C-Entailment) 

There  is  a  computable  function  ci  which  maps  a  hierarchy  H  into  a  set  of  axioms  with 
the  property  that 


if  and  only  if 


rH 


ft 


ci(H)  u  r  h  a 

3.3.7.  Theorem  3.16  (Not  Predicate  Completion) 
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Theorem  3.11  cannot  be  strengthened  by  considering  only  axioms  in  which  E  appears 
as  a  component,  instead  of  all  ones  in  which  event  types  compatible  with  E  appear  as 
components. 

3.4  Circumscription 

Predicate  circumscription  [McCarthy  84]  provides  a  proof  theoretic  realization 
of  the  model-theoretic  minimalization  operation  used  above.  Direct  use  of  the  circum¬ 
scription  schema  is  difficult,  however.  Its  most  general  form  is  a  second-order,  rather 
than  a  first-order  statement  Techniques  are  known  for  automatically  computing  first- 
order  circumscriptions  for  certain  kinds  of  axiom  sets;  for  example,  for  horn-clauses 
[Bossu  &  Seigel  85],  or  "separable"  databases  [Lifschitz  84],  None  of  these  previ¬ 
ously  known  techniques  apply  in  the  case  under  consideration  here.  Although  the 
original  hierarchy  may  be  in  horn-clauses,  the  result  of  the  first  minimization  is  a  non- 
hom  set  of  sentences,  so  the  techniques  based  on  predicate-completion  are  not  applica¬ 
ble.  None  of  the  minimizations  involve  separable  sets  of  predicates.  However,  the  re¬ 
stricted  form  of  lexical  hierarchies  has  allowed  us  to  directly  compute  a  set  of  first-order 
statements  which  characterize  all  models  resulting  from  a  certain  sequence  of  minimiza¬ 
tions.  This  work  thus  describes  a  special  but  useful  case  in  which  circumscription  can 
be  efficiently  computed. 

Recall  that  the  circumscription  of  the  set  of  predicates  it  over  a  formula  S, 
written  Circum(S[7c],JC),  stands  for  the  second-order  formula 

S[ic]  a  V  o .  (S[a]  Ao£it)3it£a 

where  the  expression  a  £  it  abbreviates  the  formula  stating  that  the  extension  of  each 
predicate  in  a  is  a  subset  of  the  extension  of  the  corresponding  predicate  in  it;  that  is: 

(Vx  .  <Ji(x)  3  itj(x))  a  ...  a  (Vx  .  crn(x)  3  itn(x)) 

3.4.1.  Theorem  3.17  (C*entailment  and  Circumscription) 

For  a  given  a  hierarchy  H,  a  statement  A  is  c-entailed  by  T  if  and  only  if  Q  follows 
from  the  following  schema: 
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r  a  Circum(  H  a  Circum(  Circum(  H^. ,  Hg-Hgg  ), 

{Any  Event}), 

Hg-{End}) 

Note  that  the  two  inner  circumscriptions  apply  only  to  the  abstraction  hierarchy,  while 
the  final  circumscription  applies  to  all  of  H. 

3.5  Example:  The  Cooking  World,  Continued 

We  return  to  the  domain  of  the  cooking,  and  discuss  some  of  the  statements 
which  appear  in  the  closure  of  the  lexical  hierarchy.  These  statements  are  then  used  to 
solve  a  simple  plan  recognition  problem.  The  closure  function,  cl,  generates  three  sets 
of  axioms:  the  exhaustiveness  assumptions  (EXA),  disjointedness  assumptions  (DJA), 
and  component/use  assumptions  (CUA).  We  consider  each  set  in  turn. 

3.5.1.  Exhaustiveness  Assumptions  (EXA) 

These  axioms  arise  by  minimizing  non-basic  event  types.  They  justify  infer¬ 
ences  from  an  abstract  type  to  the  disjunctive  of  its  specializations.  In  the  cooking 
world,  EXA  includes  the  assertion  that  ail  End  events  are  either  instances  of  preparing 
meals  or  washing  dishes: 

Vx .  End(x)  o 

PrepareMeal(x)  v 
CleanHouse(x) 

Similarly,  all  instances  of  preparing  meals  are  either  instances  of  making  a  pasta  dish  or 
making  a  meat  dish. 

Vx .  PrepareMeal(x)  z> 

MakePastaDish(x)  v 
MakeMeatDish(x) 

One  such  axiom  appears  for  every  event  type  not  in  Hfr. 

3.5.2.  Disjointedness  Assumptions  (DJA) 

These  axioms  arise  by  minimizing  all  event  types  other  than  AnyEvent.  This 
minimizes  the  set  of  types  of  each  event  token.  By  EXA,  every  event  is  of  some  basic 
type;  by  DJA,  it  is  of  no  more  than  one  basic  type.  The  final  effect  oi  this  is  that  types 
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are  disjoint,  unless  one  abstracts  the  other,  or  they  abstract  a  common  type.  It  is  im¬ 
portant  to  reiterate  that  the  assumptions  do  not  assert  that  different  event  types  cannot 
occur  simultaneously;  only  that  a  particular  token  cannot  be  of  two  non-compatible 
types.  The  cooking  world  example  includes  the  assumptions  that  preparing  a  meal  and 
cleaning  a  house  are  disjoint;  making  a  pasta  dish  and  making  a  meat  dish  are  disjoint; 
and  so  on. 


Vx .  — sPrepareMeal(x)  v  -iCleanHouse(x) 

Vx .  -iMakePastaDish(x)  v  -iMakeMeatDish(x) 

It  is  simply  to  "block"  a  potential  disjointedness  assumption  by  adding  a  new 
type.  For  example,  suppose  we  do  not  want  to  make  the  assumption  that  pasta  dishes 
and  meat  dishes  are  disjoint  Then  add  a  type  to  the  original  hierarchy  which  is  ab¬ 
stracted  by  both;  for  example,  MakeMeatRavioli. 

adding  to  H& 

Vx .  MakeMeatRavioli(x)  3  MakePastaDish(x) 

Vx .  MakeMeatRavioli(x)  3  MakeMeatDish(x) 

would  eliminate  from  DJA 

Vx  .  -iMakePastaDish(x)  v  -iMakeMeatDish(x) 

3.5.3.  Component/Use  Assumptions  (CUA) 

The  most  important  assumptions  for  recognition  arise  from  minimizing  non-End 
events.  The  assumptions  in  CUA  let  one  infer  the  disjunction  of  the  possible  causes  for 
an  event  from  its  occurrence.  The  axioms  take  us  from  an  event  to  an  event  which  has 
a  compatible  type  as  a  component  The  simplest  case  is  when  only  a  single  type  could 
have  a  particular  event  as  a  direct  component  For  instance,  the  hierarchy  has  only  a 
single  use  for  the  action  Boil,  namely  MakePastaDish. 

V  x  .  Boil(x)  3 

3  y  .  MakePastaDish(y)  a  x  =  step3(y) 
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It  is  frequently  possible  to  simplify  the  statements  in  CUA,  by  taking  advantage 
of  the  abstraction  axioms.  For  example,  according  the  rule  for  construction  of  CUA, 
the  upward-inference  axiom  for  MakeNoodle  is: 

V  x  .  MakeNoodle(x)  3 

(3  y  .  MakePastaDish(y)  a  x  =  stepl(y))  v 
(3  y  .  MakeSpaghettiMarina(y)  a  x  =  stepl(y))  v 
(3  y  .  MakeSpaghettiPesto(y)  a  x  =  stepl(y))  v 
(3  y  .  MakeFettuciniAlfredo(y)  a  x  =  stepl(y)) 

The  following  axiom  is  equivalent,  in  light  of  H^. 

V  x  .  MakeNoodle(x)  3 

3  y  .  MakePastaDish(y)  a  Ml  *  stepl(y) 

The  simplification  could  not  be  made  if  MakePastaDish  did  not  abstract  all  of  the  other 
event  types,  MakeSpaghettiMarina,  MakeSpaghetdPesto,  and  MakeFettuciiuAlfredo. 

Another  axiom  that  will  be  useful  in  our  examples  lets  one  infer  the  two  known 
uses  for  making  marinara  sauce,  namely  making  spaghetti  marinara  and  making  chicken 

marinara 


V  x  .  MakeMarinara(x)  3 

(3  y  .  MakcSpaghettiMaiinara(y)  ax*  stepl(y))  v 
(3  y  .  MakeChickenMarinara(y)  a  x  =  step3(y)) 

This  statement  has  also  been  simplified.  The  original  form  should  include  the  disjunct 
MakePastaDish,  since  MakePastaDish  has  the  direct  component  MakeSauce,  which  is 
compatible  with  MakeMarinara.  However,  the  exhaustiveness  assumptions  can  be 
used  to  conclude  that  any  MakePastaDish  is  either  MakeSpaghettiMarinara, 
MakeSpaghettiPesto,  or  MakeFettuciniAlfxedo.  The  stepl  role  in  the  latter  two  cases 
must  be  filled  by  an  event  of  type  MakePesto  or  MakeAlfredo  respectively.  The  dis¬ 
jointedness  assumptions  can  then  be  used  to  infer  that  MakeMarinaraSauce, 
MakePe:  to,  and  MakeAlfredo  are  mutually  disjoint.  Therefore  the  possibility  that 
MakeMarinaraSauce  could  be  a  component  of  MakeSpaghettiPesto  or 
MakeFettuciniAlfredo  can  be  eliminated.  We  will  use  without  further  comment  simpli¬ 
fied  versions  of  tl  :  assumptions  in  CUA. 
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Finally,  the  UPA  axiom  for  MakeSauce  demonstrates  the  importance  of  consid¬ 
ering  compatible  types.  The  only  direct  use  of  MakeSauce  is  MakePastaDish;  however, 
MakeSauce  abstracts  MakeMaxinara,  and  that  event  type  appears  in  the  decomposition 
of  MakeChickenMarinara.  Therefore  the  axiom  is: 

V  x .  MakeSauce(x)  3 

(3  y  .  MakePastaDish(y)  a  x  ■  stepl(y))  v 
(3  y  .  MakeChickenMarinara (y)  a  x  =  step5(y)) 

3.5.4.  A  Simple  Recognition  Problem 

The  assumptions  justified  by  c-entailment  can  be  used  to  solve  simple  recogni¬ 
tion  problems.  Let  us  suppose  that  the  observer  learns  that  the  cook  is  either  making 
spaghetti  or  fettucini.  This  is  not  enough  information  to  conclude  a  particular  basic  dish 
is  being  created.  It  is  safe  to  conclude,  however,  that  some  pasta  dish  is  in  the  works. 
From  this  abstract  description  of  the  End  event  in  progress,  the  observer  can  still  make 
useful  predictions.  For  example,  he  knows  that  the  agent  will  eventually  begin  to  boil 
water.  A  helpful  observer  might  fetch  a  pot  of  water,  a  enemy  observer  might  shut  off 
the  water  supply ! 

Following  is  a  proof  sketch.  The  justification  for  each  step  appears  in  italics. 
Observation 

MakeFettucini(Ml)  v  MakeSpaghetti(Ml) 


Abstraction 

MakeNoodle(Ml) 


Component/use,  Existential  Instantiation 
MakePastaDish(*I  1) 


Abstraction 

End(*Il) 


Decomposition 

Boil(  step3(*Il) )  a  After(time(Ml),  dme(step3(*Il)) ) 
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The  final  step  is  the  predication,  that  a  boiling  event  will  occur  at  some  time  after  the 
observation.  As  discussed  in  Chapter  2,  the  final  logical  form  should  replace  the  indi¬ 
vidual  parameter  *11  by  an  existentially-quantified  variable,  and  write 

3y .  Boil(  step3(y) )  a  After(time(Ml),  time(step3(y)> ) 


Chapter  4 

Minimum  Covering  Models 

C-entailment  does  not  combine  information  from  several  observations.  We 
would  like  to  interact  possible  explanations  for  each  event  This  is  done  by  selecting 
covering  models  which  minimize  the  number  of  end  events.  Minimization  can  be  un¬ 
derstood  either  as  producing  a  set  with  as  few  elements  as  possible,  or  as  producing  a 
set  with  no  redundant  elements.  The  former  sense  is  called  cardinality  minimization, 
and  the  latter,  set  minimization.  (The  use  of  the  word  minimization  without  qualifica¬ 
tion  will  always  mean  set  minimization.)  The  model  minimization  schemes  used  in 
Chapter  3  are  all  instances  of  set  minimization.  The  work  in  this  chapter  relies  on  car¬ 
dinality  minimization.  We  will  show  that  certain  cases  of  circumscription  with  vari¬ 
ables  correspond  to  numeric  minimization. 

4.1.  Cardinality  Minimization 

Let  Mi  be  a  member  of  a  class  of  models  n,  and  let  n  be  a  predicate.  Mi  has  mini¬ 
mum  cardinality  in  x  among  Ji  if  and  only  if  there  does  not  exist  any  other  model 
M2  such  that: 

1.  M2  is  a  member  of  il 

2.  The  size  of  the  extension  of  Jt  in  M2  is  smaller  than  the  size  of  the 

extension  oftc  in  Mi.  That  is, 

I  M2(Jt]  I  <  I  Mi[x]  | 

If  Mi  fails  to  be  minimal  because  of  such  an  M2,  we  say  that  M2  defeats  the  candi¬ 
dacy  of  Mi. 

Let  T  be  any  sentence  or  set  of  sentences,  and  H  a  hierarchy.  M  is  a  minimum  cover 
of  T  (relative  to  H)  just  in  case 

1.  Mis  a  model  of  T 

2.  M  is  a  covering  model  of  H 

3.  M  has  minimum  cardinality  in  End  among  covering  models  of  H 
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Suppose  T  is  any  sentence.  Then  P  is  mc-entailed  by  T,  written 

^  H  ^mc  ^ 

if  Cl  holds  in  all  minimum  covers  of  T. 

The  proof-theoretic  counterpart  of  cardinality  minimization  is  to  adopt  the 
strongest  statement  which  limits  the  number  of  distinct  end  events.  Unfortunately,  this 
step  is  not  always  effectively  computable  (the  usual  problem  with  default  reasoning).  In 
practice,  one  makes  the  strongest  assumption  possible;  forward  chains  a  limited 
amount;  if  a  contradiction  is  detected,  then  makes  the  next  weaker  assumption,  and  re¬ 
peats. 


4.1.1.  Theorem  4.1  (Minimum  Cardinality  Defaults) 

Consider  the  following  sequences  of  statements. 


MAq. 

Vx .  -iEnd(x) 

MAi. 

Vx,y  End(x)  a  End(y)  3  x=y 

MA2- 

Vx,y,z  .  End(x)  a  End(y)  a  End(z) 

=>•  (x=y)  v  (x=z)  v  (y=z) 

The  first  asserts  that  no  End  events  exist;  the  second,  no  more  than  one  End  event  ex¬ 
ists;  the  third,  no  more  than  two;  and  so  on.  Suppose  there  is  a  minimum  covering 
model  in  which  the  extension  of  End  is  finite.  Then 

F  H  ^mc  ^ 

if  and  only  if 

T  u  cl(H)  u  MAi  h  G 

where  i  is  the  smallest  integer  such  that  left-hand  side  of  the  provability  relation  is  con- 

sistent 
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4.2.  Cardinality  Circumscription 

At  first  glance  it  would  seem  that  the  theory  of  circumscription  cannot  handle 
the  problem  of  minimizing  the  cardinality  of  the  extension  of  a  predicate.  The  circum¬ 
scription  schema  states  that  there  is  no  predicate  which  satisfies  the  axioms  for  the 
minimized  predicate  which  has  an  extension  which  is  a  proper  subset  of  that  of  the 
predicate.  The  extension  must  be  minimal,  not  a  minimum. 

But  consider  the  role  of  the  non-minimized  symbols  in  the  circumscription.  In 
the  simplest  version  of  circumscription,  used  in  the  previous  chapter,  these  symbols 
must  have  the  same  denotation  in  comparable  models.  But  the  more  general  theory  of 
circumscription  [McCarthy  85]  allows  one  to  specify  that  certain  predicates,  functions, 
and/or  constants  vary  during  the  minimization.  Models  may  be  comparable  even  if  they 
do  not  agree  on  those  symbols.  The  definition  of  minimum  cardinality  above  compares 
models  without  regard  to  their  agreement  on  any  symbols.  One  could  try  to  represent 
this  process  by  circumscribing  a  predicate  where  all  other  symbols  are  allowed  to  vary. 
Surprisingly,  this  works!  Under  easily  met  conditions,  minimizing  cardinality  is 
equivalent  to  setwise  minimization  where  all  predicates  and  functions  vary. 

Why  should  this  be  the  case?  Suppose  we  have  two  models,  Mi  and  M2, 
where  the  cardinality  of  the  extension  of  End  is  larger  in  Mi  than  it  is  in  M2,  yet  the 
extension  of  End  in  Mi  does  not  contain  the  extension  of  End  in  M2.  Mi  and  M2  are 
not  comparable.  But  there  must  be  a  model  that  "looks  like"  M2,  which  is  comparable 
to  Mi.  This  model  simply  swaps  the  roles  played  by  certain  domain  elements  in  all 
predicates  and  functions,  so  that  its  extension  of  End  is  a  proper  subset  of  Mi  's.  We 
can  precisely  define  how  this  can  be  done. 

4.2.1.  Circumscription  with  Variables 

The  circumscription  schema  can  be  modified  to  allow  symbols  to  vary.  The  circum¬ 
scription  of  ic  over  a  formula  S  where  a  varies  is  written  as 

Circum(S[ic,a],Jt,  a) 

which  abbreviates  the  second-order  formula 

S[it,a]  a  V  a,P  .  (S[o,P]  AO£ic)z>it£a 
In  the  schema,  a  stands  for  either  a  single  symbol  or  a  sequence  of  symbols. 
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4.2.2. 


Theorem  4.2  (Cardinality  Circumscription) 


Let  a  include  all  the  predicate,  function,  and  constant  symbols  in  our  language  other 
than  End.  Suppose  that  all  models  of  H  are  infinite,  and  in  some  model  of  T  u  cl(H), 
End  has  a  finite  extension.  If 

Circum(r  u  cl(H),  {End},  a)  I-  Cl 

then 


F  H  ^mc  ^ 

where  Circum(T  u  cl(H),  {End},  a)  means  to  circumscribe  with  a  varying.  The  "if' 
is  strengthened  to  "if  and  only  if'  if  it  is  true  that  circumscription  is  complete  in  this 
case. 


4.2.3.  Example  of  Cardinality  Circumscription 

Consider  the  following  simple  example  of  circumscribing  the  cardinality  of  a 
predicate.  The  formula  to  be  circumscribed  contains  only  the  predicate  End,  and  three 
constant  symbols.  It  asserts  that  either  a  is  End,  or  both  of  b  and  c  are. 

S[End,a,b,c]  »  a*b  a  a*c  a  b*c  a  (  End(a)  v  (  End(b)  a  End(c)  )  ) 

The  circumscription  of  End  in  S  where  the  constants  do  not  vary  simply  strengths  the 
disjunction  to  exclusive  or. 

Circum(  S[End,a,b,c],  End  )  * 

a*b  a  a*c  a  b*c  a  (  End(a)  ©  (  End(b)  a  End(c) )  ) 

This  is  because  there  is  a  minimal  model  where  both  b  and  c  are  End.  Now  try  circum¬ 
scribing  with  a,  b,  and  c  varying. 

Circum(  S[End,a,b,c],  End,  {a,  b,  c}  )  * 

S(End,a,b,c]  a 

Vp,x,y,z  .  (  S[p,x,y,z]  a  Vw  .  p(w)  3  End(w)  )  3 

Vw  .  End(w)  3  p(w) 

Instantiations  for  p,  x,  y,  and  z  are  chosen. 
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Let  p  *  (X  u  .  u  =b) 
x  *  b 
y-a 
z  =  c 

The  instantiated  schema  becomes: 

S[End,a,b,cj  a 

(  S[(X  u  .  u  -b),b,a,cj  a  End(b) )  3  Vw  .  End(w)  3  w=b 
Expanding  S[(X  u  .  u  =b),b,a,c]: 

S[End,a,b,c]  a 

(  b*a  a  b*c  a  a*c  a  (b=b  v  (a=b  a  c=b))  a  End(b)  )  3 
Vw  .  End(w)  3  w=b 

Simplify,  by  eliminating  the  atoms  in  the  second  main  conjunct  which  are  implied  by 
the  first: 

S[End,a,b,c]  a 

End(b)  3  Vw  .  End(w)  3  w=b 
Now  reason  by  cases.  Suppose  End(b).  Then 

Vw  .  End(w)  3  w»b 

Since  c*b,  this  means  -iEnd(c).  Together  with  S[End,a,b,c],  this  yields  End(a).  On 
the  other  hand,  suppose  -iEnd(b).  Then  again,  it  must  be  the  case  that  End(a). 

So  far  we’ve  shown  that  End(a)  is  a  consequence  of  the  circumscription.  Reinstantiat¬ 
ing  the  schema  strengthens  the  conclusion  to  that  a  is  the  only  member  of  End. 

Let  p  =  (X  u .  u  =a) 
x  =  a 

y  -  b 

z-c 

It  is  straightforward  to  show  that  the  instantiated  schema: 

S[End,a,b,c]  a 

(  S[(X  u  .  u  =a),a,b,c]  a  End(a) )  3  Vw  .  End(w)  3  w=a 
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implies 


End(a)  3  Vw  .  End(w)  3  w=a 

Thus,  circumscribing  End  with  a,  b,  and  c  varying  forces  End  to  have  cardinality  1: 
Circum(  S[End,a,b,c],  End )  f-  End(a)  a  Vw  .  End(w)  3  w=a 


4.3.  Example:  The  Cooking  World,  Continued 

We  continue  with  the  example  of  plan  recognition  in  the  cooking  world.  There 
are  two  observations.  The  first  is  described  in  the  previous  chapter  the  cook  is 
reported  to  be  making  spaghetti  or  fettucini.  We  concluded  that  the  cook  was  making 
some  kind  of  pasta  dish.  The  second  observation  is  that  the  cook  is  making  marinara 
sauce.  From  the  second  observation  alone  one  could  conclude  that  one  of  the  dishes 
involving  marinara  sauce,  namely  spaghetti  marinara  or  chicken  marinara,  was  being 
prepared.  If  we  allow  the  possibility  that  the  two  observations  are  unrelated,  then  all 
that  one  can  conclude  is  the  conjunction  of  the  two  conclusions;  namely, 

3x .  MakePastaDish(x)  a 

3y .  (MakeSpaghetdMarinara(y)  v  MakeChickenMarinara(y)) 

But  consider  what  holds  in  all  the  minimum  covering  models.  In  such  a  model,  there  is 
only  one  End  events  (in  the  present  example).  Each  of  the  variable;-  -  and  y  must  be 
interpreted  as  End  event,  since  the  types  MakePastaDish,  MakeSpagl  rttiMarinara,  and 
MakeChickenMarinara  all  specialize  End.  Therefore  x  and  y  must  be  interpreted  as  the 
same  entity.  The  disjointedness  assumptions  tell  us  that  an  entity  cannot  be  both  a 
MakePastaDish  and  a  MakeChickenMarinara.  Therefore  the  cook  must  be  making 
spaghetti  marinara  in  all  minimum  covering  models.  This  is  the  conclusion  me -entailed 
by  the  observations. 

Following  is  a  proof  sketch.  The  first  part  of  the  proof  appears  in  the  previous 
chapter. 

Second  Observation 

MakeMarinara(M2) 

Upward  Inference,  Existential  Instantiation 

MakeSpaghettiMarinara(’,,I2)  v  MakeChickenMarinara(*I2) 
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Abstraction 

MakePastaDish(*I2)  v  MakeMeatDish(*I2) 

Abstraction 

EndEvent(*I2) 

Conclusion  from  first  Observation  ( Chapter  3) 
EndEvcnt(*Il) 

Strongest  Minimality  Assumption 

V  x,y  .  EndEvent(x)  a  EndEvent(y)  o  x=y 

Universal  Instantiation  &  Modus  Ponens 
*11  -  *12 


Conclusion  from  first  Observation  ( Chapter  3) 
MakePastaDish(*Il) 


Substitution  of  Equals 

MakePastaDish(*I2) 

Disjointedness  Assumption 

V  x .  — iMakePastaDish(x)  v  -iMakeMeatDish(x) 

Disjunction  Elimination 

-iMakcMeatDish(*  12) 

Abstraction 

MateCMckenMariirara(*I2)  u  MakeMeatDish(*I2) 

Modus  Tolens 

-JSlakeChickenMarmara(*I2) 

Disjunction  Elimination 

MakeSpaghettiMarinara(*I2) 
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Chapter  5 

Incremental  Recognition 

A  recognition  problem  is  incremental  when  the  recognizer  is  presented  with  a 
temporal  sequence  of  observations,  T T2,  ...  •  At  any  point  in  the  sequence,  the 
recognizer  can  be  queried  as  to  the  consequences  of  the  observations  made  so  far. 

Suppose  mc-entailment  is  used  to  model  the  recognition  process.  Then  after 
observation  Tn,  the  recognizer  can  conclude  &  just  in  case 

I“i  u  ...  u  rn  jj  bmc  Q 

The  mc-entailment  relation  is  non-monotonic.  As  Tj  are  added  to  the  left-hand  side,  the 
set  of  conclusions  may  no  longer  include  £1  Consider  the  following  specific  case. 

Each  observation  is  of  the  occurrence  of  a  single  event  Each  observed  event 
must  be  a  component  of  an  End  event;  call  these  End  events  N^,  N2, . . .  Nn.  Let  there 
be  a  minimum  covering  model  for  Tj, ...  Tn  in  which  the  extension  of  End  has  cardi¬ 
nality  1;  that  is,  Ni»N2»...*Nn.  If  the  number  of  observations  is  of  reasonable  size, 
it  will  quite  often  be  the  case  that  there  is  only  a  single  specialization  of  End  which 
could  account  for  all  of  the  observations.  In  particular,  suppose  that 

n  u  ...  u  I"n  jj  me  ^  x  •  ^ic(x) 

Now  suppose  the  n+lst  observed  event  cannot  be  part  of  the  same  End  event  as  all  the 
previous  ones.  A  minimum  cover  for  Ti, ...  Tn+i  must  be  of  size  2.  The  recognizer 
can  no  longer  assume  that  Ni»N2»...*Nn.  Instead,  it  may  be  the  case  that  N 1  =Nn+ 1 , 
and  all  the  other  Nj  are  equal;  or  that  N2»Nn+i;  and  so  on.  There  may  be  as  many  as 
2n  different  ways  of  grouping  the  observations,  and  each  could  correspond  to  a  differ¬ 
ent  minimum  covering  modeL  The  conclusion  3x.Ek(x)  must  be  withdrawn,  and  re¬ 
placed  with  a  long  disjunction  of  different  specializations  of  End. 

5.1.  Deficiencies  in  the  MC  Model 

This  kind  of  non-monotonic  behavior  is  justified  as  a  normative  theory  of  what 
an  agent  should  come  to  believe,  if  the  minimization  of  End  events  is  the  only  basis  for 
combining  information  from  different  observations.  Objections  can  be  raised  on  the 
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grounds  of  complexity  and  psychological  plausibility.  The  theory  of  mc-entailment  can 
be  adjusted,  however,  to  both  have  a  more  efficient  computational  realization,  and  to 
more  accurately  reflex  some  of  our  intuitions  about  incremental  recognition  processes. 


5.1.1.  The  Combinatorial  Problem 

Once  the  minimum  cardinality  default  is  weakened  to  allow  two  or  more  distinct 
End  events,  the  number  of  ways  of  grouping  the  observations  together  as  components 
of  the  same  End  event  grows  exponentially.  In  some  domains  the  various  constraints 
associated  with  event  types  could  quickly  rule  out  most  of  these  possibilities.  But  it 
seems  clear  that  additional  principles  will  eventually  be  needed  to  deal  with  realistically- 
sized  problems. 

5.1.2.  The  Persistence  Problem 

Related  to  the  combinational  problem  is  the  failure  of  mc-entailment  to  match  up 
with  our  intuitions  about  the  way  a  person  would  go  about  solving  an  incremental 
recognition  problem.  Once  several  pieces  of  information  are  "tied  together",  it  seems 
unnatural  to  break  that  connection,  simply  because  some  seemingly  unrelated  piece  of 
information  comes  along. 

Consider  the  following  (very!)  short  story,  bearing  in  mind  the  event  hierarchy 
in  the  beginning  of  Chapter  1. 

Leo  needed  some  cash.  He  took  out  his  shotgun.  Then 
he  went  to  the  bank.  Later  that  day,  he  was  seen  in  the 
woods. 

As  we  read  along,  the  descriptions  of  the  Leo's  getting  a  gun  and  going  to  the  bank, 
together  with  his  state  of  needing  money,  invoke  the  bank  robbery  event  (or  plan  or 
script  or  ...).  Walking  in  the  woods  is  not  part  of  robbing  a  bank.  None  the  less,  the 
story  still  seems  to  be  about  a  bank  robbery;  we  don't  consider  the  possibility  that  Leo 
got  his  gun  in  order  to  go  hunting  in  the  woods.  After  the  first  three  sentences,  we 
conclude  something  like 

3  x  .  RobBankfx)  a  agent(x)=Leo 

and  this  belief  persists  even  after  the  last  sentence.  Mc-entailment  would  justify  this 
conclusion  after  the  first  three  sentences,  but  after  the  last  would  only  justify  the  weaker 
conclusion: 
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3  x,y  • 

(RobBank(x)  a  (Hunt(y)  v  Go-Hilring(y))  v 
(CashCheck(y)  a  Hunt(x)) 

What  is  going  on?  One  explanation  is  that  we  are  minimizing  End  events  in  a 
incremental  fashion.  If  we  have  to  increase  the  number  of  End  events  to  account  for  the 
story,  we  do,  but  we  try  not  to  discard  any  previous  conclusions.  Of  course,  new  in¬ 
formation  could  force  us  to  withdraw  a  previous  conclusion.  So  if  the  story  continued 

Leo  had  spent  a  long  time  in  line  ax  the  bank,  waiting  to 
cash  his  paycheck.  He  was  glad  to  finally  be  out  stalk - 
ing  the  wild  moose. 

the  earlier  conclusion  is  eliminated.  But  such  stories  appear  either  awkward  or  deliber¬ 
ately  misleading  (as  a  mystery  story  should  be);  making  an  analogy  with  syntax,  one 
might  call  them  garden-path  stories. 1 

•  This  manner  of  reasoning  is  understandable  if  one  views  the  times  between  ob¬ 
servations  as  points  at  which  the  recognizer  accepts  his  non-deductive  conclusions  as 
full  beliefs.  These  beliefs,  together  with  whatever  else  the  recognizer  believes,  are 
subject  to  minimal  revision  as  contradictory  information  is  learned.  But  new  non-de¬ 
ductive  conclusions  are  not  (in  this  model)  die  basis  for  revising  old  beliefs. 

The  difficulty  with  this  model  is  that  the  minimum  cardinality  defaults  are  al¬ 
ways  revised  when  the  number  of  End  events  must  be  increased.  How  then  can  con¬ 
clusions  based  on  these  assumptions  be  retained?  Section  5.3  below  develops  a  model 
theory  which  exhibits  this  behavior,  by  separating  existential  and  universal  conclusions: 
the  former  may  persist  while  the  latter  are  rejected. 

5.2.  Refining  the  Model 

This  section  considers  a  number  of  factors  that  could  go  into  a  theory  of  incre¬ 
mental  recognition. 

5.2.1.  Incrementally  Minimize  Cardinality 


*  A  garden-path  sentence  is  a  syntactically  "cotrect"  sentence  which  is  difficult  to  understand,  because 
the  hearer  makes  incorrect  decisions  about  its  constituent  structure  before  the  end  of  the  sentence  is 
reached.  The  standard  example  is  the  sentence,  "The  horse  raced  past  the  bam  fell." 
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Both  the  persistence  problem  and  some  aspects  of  the  complexity  problem  are 
addressed  by  incrementally  minimizing  the  number  of  End  events,  as  described  above. 
Most  of  the  plan  recognition  systems  described  in  the  literature  perform  an  incremental 
minimization,  usually  together  with  some  of  the  following  heuristics  and  constraints. 
Chapter  7  describes  an  algorithm  for  a  plan  recognizer  which  implements  this  heuristic. 

5.2.2.  Sticky  Covers 

A  heuristic  for  grouping  observations  described  in  [Huff  &  Lesser  82]  is  to  as¬ 
sociate  each  new  observation  with  the  most  recent  observation  with  which  it  can  con¬ 
sistently  grouped.  We  call  this  the  sticky  covering  assumption,  since  each  observation 
tries  to  "stick"  to  the  previous  one.  The  sticky  heuristic  solves  the  combinatorial  prob¬ 
lem,  as  discussed  in  detail  in  Section  7.9.3,  but  is  even  more  sensitive  to  garden-path 
type  errors.  An  algorithm  for  calculating  conclusions  true  in  all  sticky  covers  is  given 
in  Chapter  7,  but  we  have  not  tried  to  work  out  a  corresponding  model  theory. 

5.2.3.  Discourse  Clues 

Much  work  in  computational  models  of  discourse  attempts  to  find  clues  which 
indicate  how  the  various  utterances  which  make  up  an  extended  discourse  should  be 
grouped.  These  take  into  consideration  such  features  as  "clue  words"  (such  as  "but" 
and  "therefore")  [Grosz  77]  and  intonation  [Pierrehumbert  77]. 

5.2.4.  Likelihood  Associations 

Eventually  well  need  to  deal  with  the  frequency  with  which  an  event  appears  as 
a  component  of  its  different  uses.  The  less  common  uses  may  be  ignored  if  possible. 
Such  a  quantitative  approach  is  not  antithetical  to  the  work  described  here.  Once  our 
qualitative  methods  choose  all  the  possible  interpretations  of  the  data  under  some  set  of 
simplicity  heuristics,  probabilistic  methods  can  be  applied  to  rank  and  choose  between 
the  alternatives.  This  is  exactly  the  approach  taken  by  expert  systems  which  employ 
notions  similar  to  covering  models,  as  described  in  Sections  1.4  and  6.4. 

5.3.  Model  Theory  for  Incremental  Minimization 

The  section  defines  a  weaker  version  of  minimum  covering  model,  and  then 
uses  it  to  define  an  incremental  operator,  imc-entailment 
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5.3.1. 


Minimum  Covering  Submodels 


Mc-entailment  justifies  some  kinds  of  conclusions  that  one  would  not  ordinarily 
draw  from  a  set  of  observations.  Suppose  that  one  observes  A  and  B,  and  C  is  the 
only  End  act  which  entails  both  A  and  B.  Then  would  seem  reasonable  to  conclude  C, 
and  mc-entailment  justifies  this  conclusion.  However,  mc-entailment  also  justifies  the 
conclusion  that  no  End  act  other  than  C  occurs;  and  that  assumption  was  the  basis  for 
the  previous  conclusion.  Yet  the  statement  that  only  one  event  will  ever  occur,  at  all 
times  now  and  in  the  future,  is  almost  surely  false.  The  problem  is  that  our  intuitively 
acceptable  conclusion  C,  has  the  same  status  as  the  dubious  conclusion  that  nothing 
else  will  ever  happen. 

Simply  eliminating  the  cardinality  minimization  would  eliminate  the  doubtful 
conclusion,  but  at  the  cost  of  any  combination  of  information  from  multiple  observa¬ 
tions.  A  more  attractive  approach  is  to  weaken  the  notion  of  mc-entailment.  Instead 
of  minimizing  the  number  of  End  events  per  se,  we  really  only  need  to  minimize  the 
number  of  events  which  account  for  all  the  observations.  There  may  be  other  End 
events  which  occur  at  other  times  and  places,  but  they  do  not  involve  the  current  set  of 
observed  events.  Instead  of  drawing  conclusions  based  on  the  class  of  minimum  cov¬ 
ering  models,  one  should  draw  conclusions  based  on  the  .larger  class  of  models  which 
contain  a  minimum  covering  submodel. 

Theorem  5.1  states  that  all  the  existential  conclusions  that  one  can  obtain  by 
considering  minimum  covering  models  can  also  be  obtained  by  considering  the  larger 
class.  This  will  provide  the  basis  for  retaining  conclusions  such  as 

3  x  .  RobBank(x)  a  agent(x)«Leo 
while  dropping  the  assumption  that 

V  x,y  .  End(x)  a  End(y)  3  x»y 

5.3. 1.1.  Definitions 

Mi  is  a  submodel  of  M2  if  the  domain  of  Mi  is  a  subset  of  M2;  Mi  and  M2  agree  on 
all  predicates  for  tuples  consisting  only  of  elements  in  and  Mi  and  M2  agree  on  the 
interpretation  of  all  functions,  restricted  to  the  domain  of  If  Mi  is  both  a  covering 
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model  of  T  relative  to  H,  and  has  a  submodel  M2  which  is  a  minimum  cover  for  T  rela¬ 
tive  to  H,  we  say  contains  a  minimum  cover  for  r  relative  to  H. 

ft  is  mcs-entailed  by  r,  written 
r  jj  h mcs  & 

if  ft  holds  in  all  models  which  contain  a  minimum  cover  of  I*  relative  to  H.  For  brevi¬ 
ty,  we  will  call  a  model  which  contains  a  minimum  cover  an  mcs-model. 

5.3. 1.2.  Theorem  5.1  (Non-Uni versal  Conclusions) 

Let  £1  be  a  sentence  which,  when  written  with  all  quantifiers  in  initial  position,  contains 
no  universal  quantifiers.  Then 

F  h  ^ mcs  ^ 

if  and  only  if 

^  g  f'mc  ^ 

5.3.2.  Monotonic  Incremental  Recognition 

The  intuition  behind  a  monotonic  theory  of  incremental  recognition  is  that  after 
the  recognizer  obtains  a  piece  of  evidence,  he  believes  forever  the  consequences  of  that 
evidence.  This  section  defines  the  final  operator  which  relates  a  sequence  of  observa¬ 
tions  to  its  incrementally  minimum  covering  entailed  conclusions.  Theorem  5.2  states 
that  this  operator  has  the  desired  properties  of  acting  like  mc-entailment  when  there  is 
only  one  observation,  and  allowing  existential  conclusions  to  persist  as  new  observa¬ 
tions  are  made. 

5.3.2. 1.  Definitions 

Let  F  -  (Ti,  T2,  ...  rn)  be  an  observation  sequence.  Following  is  a  recursive 
definition  of  the  class  of  incremental  minimal  covers  of  T.  Let  us  say  that  M  is  a 
n-candidate  if  M  is  a  covering  model  of  T  relative  to  H,  and  (if  n>l)  M  is  an  incre¬ 
mental  minimum  cover  of  (I*i,  T2, ...  Tn).  Then  M  is  an  incremental  minimal  cover 
of  T  reladve  to  H  if  M  is  an  n-candidate,  and: 
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(i)  n*  1 ,  and  M  contains  a  minimum  cover  for  T  relative  to  H. 


(ii)  or  n  >  1,  and  M  contains  a  submodel  M2  which  has  minimum  cardi¬ 
nality  in  End  among  covering  models  of  r  which  are  submodels 
of  n-candidatcs. 

Q  is  line-entailed  by  (Tj,  T2, ...  rn),  written 
(1*1*  ^2*  •••  I*n)  jj  hime  Q 

if  Q  holds  in  all  models  which  are  incremental  minimum  covers  of  (Ql,  Q2,  ...  Qn) 
relative  to  H. 

5.3. 2.2.  Theorem  5.2  (Incremental  Recognition) 

In  the  case  of  a  single  observation,  imc-entailment  is  the  same  as  mcs-entailment 

ff*l)  H  Q 

if  and  only  if 

1*1  jj  Is mes  ^ 

In  the  case  of  multiple  observations,  imc-entailment  is  monotonic. 

(1*1.  •••  I*n_i)  ^ime  ^ 

implies 

(1*1*  •••  r n-i,  Fn)  pj  hjmc  ^ 


7A-D-  75 


Chapter  6 
Examples 

6. 1 .  The  Cooking  World,  Once  More 

The  examples  at  the  end  of  Chapters  3  and  4  illustrated  inference  from  a  dis¬ 
junctive  observation,  and  the  combination  of  observations  to  uniquely  identify  the  plan 
in  progress.  The  next  example  illustrates  two  other  important  features  of  our  system: 
the  ability  to  handle  disjunctive  hypotheses,  and  to  draw  conclusions  that  reflect  an  ab¬ 
stract  description  of  a  class  of  possible  plans. 

Suppose  that  the  observer  learns  that  the  cook  is  preparing  some  kind  of  sauce. 
This  is  not  enough  information  to  conclude  a  particular  basic  dish  is  being  created:  one 
cannot  (as  appears  to  be  the  case  in  most  of  the  plan  recognition  systems  described  in 
Chapter  1)  imagine  that  a  single  plan  is  evoked.  Instead,  one  is  justified  in  concluding 
that  the  cook  is  making  some  Pasta  Dish,  or  Chicken  Marmara.  This  disjunction  col¬ 
lapses,  via  the  abstraction  axioms,  to  the  fact  that  an  event  of  type  PrepareMeal  is  oc¬ 
curring.  This  final  piece  of  information,  non-specific  as  it  is,  may  still  be  of  great 
interest:  for  instance,  if  we  are  hungry!  The  example  also  illustrates  the  importance  of 
adopting  a  formal  framework  that  guarantees  completeness:  all  possible  uses  of  an 
event  are  considered.  A  noted  in  Chapter  3,  the  only  direct  use  of  MakeSauce  is 
MakePastaDish.  It  is  not  clear  whether  any  of  the  earlier  heuristic-based  plan  recogni¬ 
tion  systems  would  consider  MakeChickenMarinara,  or  if  they  would  immediately  (and 
unjustifiably)  jump  to  the  first  conclusion.  The  proof  so  far  is: 

Observation 

MakeSauce(Obsl) 

Component  Use,  Existential  Instantiation 

MakePastaDish(*Il)  v  MakeChickenMarinara(*Il) 

Abstraction 

MakePastaDish(*Il)  v  MakeMeatDish(*Il) 

PrepareMeal(*Il) 

End(*Il) 
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Now  let  the  next  observation  be  that  the  agent  is  making  Noodles,  and  assume  the  End 
event  inferred  from  the  first  observation  is  the  same  as  the  End  event  inferred  from  the 
second.  Mc-entailment  lets  us  conclude  that  a  plan  to  make  some  Pasta  Dish  is  in 
progress.  We  do  not  need  (cannot,  in  fact)  conclude  that  a  particular  kind  of  Pasta 
Dish  is  under  construction. 

Second  Observation 

MakeNoodles(Obs2) 

Component/Use,  Existential  Instantiation 
MakePastaDish(*I2) 

Abstraction 

PrepareMeal(*I2) 

End(*I2) 

Strongest  Minimality  Assumption 

V  x,y  .  End(x)  a  End(y)  3  x*y 

Universal  Instantiation  &  Modus  Ponens 

*11  -  *12 


Substitution  of  Equals 

MakePastaDish(*Il) 

Disjointedness  Assumption 

V  x .  -iMakePastaDish(x)  v  -iMakeMeatDish(x) 

Disjunction  Elimination 

-iMakeMeatDish(*Il ) 

Abstraction 

MakeChickenMarinara(*Il)  z>  MakeMeatDish(*Il) 

Modus  Tolens 

— iMakcChickenMarinara(*I  1 ) 

Disjunction  Elimination 

MakePastaDish(*Il) 
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Another  example  from  this  domain  appears  in  Chapter  7  and  in  Appendix  E 
(Transcripts).  That  example  shows  how  one  observation  can  constrain  a  different  ob¬ 
servation  to  be  of  specialized  in  a  particular  way.  Specifically,  a  later  observation  of 
MakeMarinara  constrains  an  earlier  observation  of  MakeNoodles  to  be  in  fact  an  in¬ 
stance  of  MakeSpaghetti.  Information  can  flow  from  one  observation  to  another. 

6.2.  Indirect  Speech  Acts 

A  proper  treatment  of  discourse  requires  careful  attention  to  the  representation 
of  beliefs  and  intentions,  as  well  as  to  the  relation  of  an  utterance  to  its  meaning.  An 
fully  adequate  treatment  therefore  necessitates  the  use  of  some  sort  of  intensional  or 
modal  logic.  None  the  less,  we  will  try  to  approximate  the  problem  of  recognizing  a 
speech  act  using  the  tools  at  hand,  in  order  to  suggest  how  the  present  approach  might 
be  extended  to  the  discourse  domain. 

6.2.1.  Representation 

Recall  that  properties  are  represented  by  terms.  The  property  that  an  agent  A 
knows  P  is  simply  represented  by  applying  the  function  know  to  terms  for  the  agent 
and  property  P.  The  function  can  relates  an  agent  and  a  property  the  agent  can  bring 
about  We  assume  that  for  every  act  (event)  type,  there  is  a  property  which  holds  just 
after  an  instance  of  the  event  occurs.  Thus  the  fact  that  John  knows  at  time  T1  that 
Mary  can  give  John  the  salt  might  be  written: 

Holds(know(John, can  (Mary, gavc(MaryJohn,salt))),  T  i) 

Finally  we  need  the  function  knowif,  which  relate  an  agent  and  a  property  whose  truth 
value  the  agent  knows.  Many  objections  can  be  raised  to  this  notation,  such  as  its  ref¬ 
erential  transparency,  and  the  omission  of  time  indexes  on  the  objects  of  beliefs,  but  it 
suffices  for  the  example. 

The  following  diagram  shows  part  of  the  event  hierarchy.  The  types  Sur- 
facelmperadve  and  SurfaceQuestion  classify  utterances  of  commands  and  questions, 
respectively.  Request  and  Informlf  are  speech  act  event  type.  A  request  can  be  spe¬ 
cialized  as  a  Command,  a  DirectRequest,  or  an  IndirectRequest  Each  of  these  special¬ 
izations  corresponds  to  different  decomposition  of  the  Request  speech  act,  such  as  ap¬ 
pears  in  (Litman  843.  Some  of  the  discourse  plans  which  employ  speech  acts  are  Ob- 
tainBy Asking  and  FindOutByAsking.  These  discourse  plans  specialize  more  general 
methods  for  obtaining  objects  and  finding  out  information.  There  may  be  a  great  deal 
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of  structure  above,  or  the  discourse  plans  themselves  may  be  taken  as  End  events,  and 
thus  terminate  analysis.  The  example  here  does  not  depend  on  any  higher  structure  in 
the  library. 
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figure  6.1:  Language  Use  Hierarchy 

Following  is  an  abbreviated  list  of  the  axioms  for  this  hierarchy.  All  the  actions 
have  roles  of  agent  and  time,  and  other  roles  as  specified. 

•  Non-linguisric  Acts:  Obtain  has  the  role  of  the  object  to  be  obtained.  During  pre¬ 
Time  the  agent  does  not  have  the  object;  during  postTime  the  agent  does.  Find- 
Out  has  the  role  info,  a  property,  whose  truth-value  is  to  be  determined.  During  pre¬ 
Time  the  agent  does  not  knowif  the  info  is  true  or  false;  during  postTime  the 
agent  does. 


Vx .  Obtain(x)  3 

Never(have(agent(x),obj(x)),  preTime(x))  a 
meets(preTime(x),  time(x))  a  ... 

Hold$(have(agent(x),obj(x)),  postTime(x))  a 
meets(time(x),  postTime(x))  a  ... 

Vx .  FindOut(x)  3 

Never(knowif(agent(x),  info(x)),  preTime(x))  a 
meets(preTime(x),  time(x))  a 
Holds(knowif(agent(x),  info(x)),  postTime(x))  a 
meets(time(x),  postTime(x))  a  ... 

•  Discourse  Acts:  The  decomposition  of  ObtainByAsking  states  that  the  agent  per¬ 
forms  a  Request,  whose  reqGoal  is  that  the  hearer  gave  the  agent  the  object.  This 
step,  si,  is  followed  by  step  s2,  in  which  the  hearer  gives  the  agent  the  object. 

Vx .  ObtainByAslting(x)  3 
Requests  l(x))  a 

reqGoaI(sl(x))  »  gave(heaier(sl(x)),  agent(x),  obj(x))  a 
Give(s2(x))  a  ... 

FindOutBy Asking  is  veiy  similar,  the  decomposition  states  that  the  agent  performs  a 
Request,  whose  reqGoal  is  that  the  hearer  has  informed  the  agent  of  the  truth  value 
of  the  info.  In  s2  the  affected  performs  an  Informlf  to  the  agent. 

Vx .  FindOutBy Asking(x)  3 
Requests  l(x))  a 

reqGoal(sl(x))  *  informedIf(hearer(sl(x)),  agent(x),  info(x))  a 
InformIf(s2(x))  a  ... 

•  Speech  Acts:  Each  of  the  specializations  of  a  Request  contains  a  single  utterance  act 
in  its  decomposition.  The  single  step  in  a  DirectRequest  is  x  surface  yes/no  question. 
The  reqGoal  of  the  DirectRequest  must  be  that  the  affected  has  informed  the  agent 
of  the  truth  value  of  the  (propositional)  content  of  the  SurfaceQuestion. 
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Vx .  DircctRequest(x)  z> 

SurfaceQuestion(s(x))  a 

reqGoal(x)  « informedIf(hearer(x),  speaker(x),  content(s(x)))  a  ... 

The  single  step  of  an  IndirectRequest  is  also  a  SurfaceQuestion;  however,  there  is  a 
different  relation  between  the  content  of  the  question  and  the  reqGoal  of  the  request. 
The  question  must  be  a  "can"  question,  of  the  form,  "can  the  affected  bring  about  the 
goal  of  the  request?" 

Vx .  IndirectRequest(x)  3 

SuifaceQuestion(s(x))  a 

content(s(x))  »  can(hearer(x),  reqGoal(x))  a  ... 

6.2.2.  Assumptions 

The  following  component/use  axioms  are  generated  by  assuming  that  the 
hierarchy  is  complete.  Whenever  a  SurfaceQuestion  occurs,  it  must  be  part  of  a  Di- 
rectRequest  or  an  IndirectRequest;  and  whenever  a  Request  occurs,  it  must  be  part  of 
an  ObtainByAsking  or  FindOutByAsking. 

Vx .  SurfaceQuesdon(x)  3 

(3y  .  DirectRequest(y)  a  x=s(y))  v 
(3y  .  IndirectRequest(y)  a  x»s(y)) 

Vx .  Request(x)  3 

(3y  .  ObtainByAsking(y)  a  x»sl(y))  v 
(3y  .  FindOutByAsking(y)  a  x»sl(y)) 

6.2.3.  The  Problem 

Suppose  S  says  to  H,  "Can  you  give  me  the  salt?'  Real  world  knowledge  (part 
of  Hq)  includes  the  statement  that  at  all  times,  S  knows  whether  or  not  H  can  give  S 
the  salt* 


Vt .  Hold(knowIf(S,  can(H,  gave(H,S,salt))),  t) 

An  instance  of  a  SurfaceQuestion,  Ql,  occurs,  with  the  content,  "H  can  give  S  the 
salt”. 
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SurfaceQuestion(Ql)  a  speaker(Ql)  =  S  a  hearer(Ql)  =  H  a 
content(Ql)  =  can(H,  gave(H,S,salt)) 

Now  apply  the  uoward  inference  assumption  for  SurfaceQuestion.  It  must  be  case  that 
Q1  is  either  a  step  of  some  DirectRequest  (call  it  *R1)  or  some  IndirectRequest  In  ei¬ 
ther  case  the  constraints  can  be  used  to  determine  the  possible  reqGoals.  In  the  direct 
case,  the  goal  must  be  for  H  to  tell  S  whether  or  not  H  can  give  S  the  salt  In  the  indi¬ 
rect  case,  the  goal  must  be  for  H  to  give  S  the  salt 

DirectRequest(*Rl)  aQU  s(*Rl)  a 
reqGoal(*Rl)  =  informedIf(H,S,  can(H,  gave(H,S,salt))) 
v 

IndirectRequest(*Rl)  a  Ql  =  s(*Rl)  a 
reqGoal(*Rl)  -  gave (H,S, salt) 

Neither  alternative  can  (yet)  be  eliminated  on  the  basis  of  inconsistent  constraints.  So 
we  apply  the  second  upward  closure  assumptions  to  this  statement,  yielding  a  four- way 
disjunction  describing  the  act  *R2  which  has  step  *R1.  Constraint  information  can 
now  be  used  to  eliminate  all  but  one  alternative. 

FindOut  Action 

Direct  Request :  Precondition  Fails 

*R1  is  a  direct  request,  and  the  goal  of  *R1  is  to  be  informed  if 
H  can  pass  the  salt.  *R1  is  the  first  step  of  the  FindOut  action. 
However,  the  constraint  that  S  does  not  know  if  H  can  pass  the 
salt  before  the  action  occurs  is  provably  false. 
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Indirect  Request :  Ill-Formed 

*R1  is  an  indirect  request,  and  the  goal  of  *R1  is  for  H  to  pass 
the  salt  *R1  is  the  first  step  of  the  FindOut  action.  However, 
to  be  part  of  FindOut,  the  goal  of  *R1  must  be  of  the  form  in- 
formedIf(-),  instead  of  gave(-).  Therefore  this  alternative  is  in¬ 
formed. 


Obtain  Action 

Direct  Request :  Ill-Formed 

*Ri  is  a  direct  request,  and  the  goal  of  *R1  is  to  be  informed  if 
H  can  pass  the  salt  *R1  is  the  first  step  of  the  Obtain  action. 
However,  to  be  part  of  Obtain,  the  goal  of  *R1  must  be  of  the 
form  gave(-),  instead  of  informedIf(-).  Therefore  this  alterna¬ 
tive  is  ill-formed. 

Indirect  Request :  Accepted 

*R1  is  an  indirect  request  and  the  goal  of  *R1  is  for  H  to  pass 
the  salt  *R1  is  the  first  step  of  the  Obtain  action.  This  alterna¬ 
tive  is  well  formed,  and  no  violated  constraints  can  be  found. 

The  follow  statement  encodes  this  analysis.  An  X  appears  at  the  point  in  each  alterna¬ 
tive  where  an  inconsistency  is  detected.  The  ill-formed  alternatives  are  ruled  out  by  the 
property  identity  axioms  discussed  in  Section  2.2. 

FindOutByAslring(*R2)  a  *R1  »  sl(*R2)  a 
DixectRequest(*Rl)  a  Q1  *  s(*Rl)  a 
reqGoal(*Rl)  *  informedIf(H,S,  can(H,  gave(H,S,salt)))  a 
reqGoal(sl(*R2))  =*  informedIf(H,S,  info(*R2))  a 
info(*R2)  *  can(H,  gave(H,S,salt))  a 
X  Never(  knowif(S,  can(H,  gave(H,S,salt))),  preTime(*R4)) 
v 

FindOutByAsldng(*R2)  a  *R1  »  sl(*R2)  a 

IndirectRequest(*Rl)  a  Q1  »  s(*Rl)  a 
reqGoal(*Rl) »  gave(H,S,salt) 

X  reqGoal(sl(*R2))  »  informedIf(H,S,  info(*R2))  a 
v 
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ObtainByAsking(*R2)  a  *R1  =  sl(*R2)  a 
DirectReque$t(*Rl)  a  Q1  ■  s(*Rl)  a 
reqGoal(*Rl)  »  informedIf(H,S,  can(H,  gave(H,S,salt)))  a 
X  reqGoal(sl(*R2))  »  gave(H,S,  obj(*R2))  a 
v 

ObtainByAsking(*R2)  a*R1»  sl(*R2)  a 
IndirectRequest(*Rl)  a  Ql  *  s(*Rl)  a 
reqGoal(*Rl)  *  gave(H,S,salt)  a 

reqGoal(sl(*R2)) «  gave(H,S,object(*R2))  a 
✓  object(*R2) » salt 

The  final  disjunct  must  be  true:  The  recognizer  is  justified  in  concluding  that  S  is  per¬ 
forming  the  plan  to  obtain  the  salt  by  asking  for  it 

ObtainByAsking(*R2)  a  object(*R2)  *  salt  a  *R1  *  sl(*R2)  a 
IndirectRequest(*Rl)  a  reqGoal(*Rl)  »  gave(H,S,salt)  aQU  s(*R1)  a 
SurfaceQuestion(Ql)  a  content(Ql)  -  can(H,  gave(H,S,salt)) 

6.3.  Operating  Systems:  Multiple  Events 

Several  research  groups  have  examined  the  use  of  plan  recognition  in  "smart" 
operating  systems,  which  could  answer  user  questions  and/or  watch  what  the  user  was 
doing,  and  make  suggestions  about  potential  pitfalls  and  more  efficient  ways  of  accom¬ 
plishing  the  same  tasks.  A  user  often  works  on  several  different  tasks  during  a  single 
session  at  the  terminal,  and  frequently  jumps  back  and  forth  between  uncompleted 
tasks.  Therefore  a  plan  recognition  system  for  this  domain  must  be  able  to  handle  mul¬ 
tiple  concurrent  unrelated  plans.  The  very  generality  of  the  present  approach  is  an  ad¬ 
vantage  in  this  domain,  where  the  focus-type  heuristics  used  by  other  plan  recognition 
systems  are  not  so  applicable. 

* 

6.3.1.  Representation 
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figure  62:  Operating  System  Hierarchy 

Consider  the  following  hierarchy.  There  are  two  End  plans:  to  Rename  a  file, 
and  to  Modify  a  file. 


Vx  .  Rename(x)  3  End(x) 


Vx .  Modify(x)  3  End(x) 

There  are  two  ways  to  specialize  the  Rename  event  A  RenameByCopy  involves 
Copying  a  file,  and  then  Deleting  the  original  version  of  the  file,  without  making  any 
changes  in  the  original  file. 

Vx  .  RenameByCopy(x)  3  Rename(x) 
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Vx .  RenameByCopy(x)  3 
Copy(sl(x))  a 
Delete(s2(x))  a 
old(sl(x)) -  old(x)  a 
new(sl(x))  -  new(x)  a 
ffle(s2(x))  -  old(x)  a 
BeforeMeet(tiine(sl(x)),  time(s2(x)))  a 
Never(modified(old(x)),  time(x))  a 

Starts(time(sl(x)),  time(x))  a 
Finishcs(timc(s2(x)),  time(x)) 

A  better  way  to  rename  a  file  is  to  RenameByMove,  which  simply  uses  the  Move  com¬ 
mand.  A  helpful  system  might  suggest  that  a  user  try  the  Move  command  if  it  recog¬ 
nizes  many  instances  of  RenameByCopy. 

Vx .  RenameByMove(x)  3  Rename(x) 

Vx .  RenameByMove(x)  3 
Move(sl(x))  a 

old(sl(x)) »  old(x)  a 
new(sl(x))  »  new(x) 

The  Modify  command  has  three  steps.  In  the  first,  the  original  file  is  backed  up  by 
Copying.  Then  the  original  file  is  Edited.  Finally,  the  backup  copy  is  deleted. 

Vx .  Modify(x)  3 

Copy(sl(x))  a 
Edit(s2(x))  a 
Delete(s3(x))  a 
old(sl(x)) »  file(x)  a 
file($2(x)) «  file(x)  a 
file(s3(x)) «  new(sl(x))  a 

BeforeMeet(time(sl(x)),  time(s2(x)) )  a 
BeforeMeet(time(s2(x)),  time(s3(x))) 

6,3.2.  Assumptions 
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Following  are  some  of  the  statements  obtained  by  minimizing  the  hierarchy. 
The  Component/Use  assumptions  include  the  statement  that  every  Copy  acdon  is  either 
part  of  a  RenameByCopy  or  of  a  Modify. 

Vx .  Copy(x)  d 

(3y  .  RenameByCopy (y)  a  x»sl(y))  v 
(3y  .  Modifyfy)  a  x-sl(y)) 

Every  Delete  event  is  either  the  second  step  of  a  RenameByCopy,  or  the  third  step  of  a 
Modify,  in  any  covering  modeL 

Vx .  Delete(x)  3 

(3y  .  RenameByCopy(y)  a  x«s2(y))  v 
(3y  .  Modifyfy)  a  x«s3(y)) 

6.3.3.  The  Problem 

Suppose  the  plan  recognition  system  observes  each  action  the  user  performs. 
Whenever  a  new  file  name  is  typed,  the  system  generates  a  constant  with  the  same 
name,  and  asserts  that  that  constant  is  not  equal  to  any  other  file  name  constant  (We 
don't  allow  UNIX™  style  "links"!)  During  a  session  the  user  types  the  following 
commands. 


(1)  %  copy  foo  bar 

(2)  %  copy  jack  sprat 

(3)  %  delete  foo 

The  system  should  recognize  two  concurrent  plans.  The  first  is  to  rename  the  file  "foo" 

to  "bar".  The  second  is  to  either  rename  or  modify  the  file  "jack".  Let's  examine  how 

these  inferences  could  be  made. 

Statement  (1)  is  encoded: 

Copy(Cl)  a  old(Cl)*foo  a  new(Cl)*bar 

The  decomposition  completeness  axiom  for  Copy  lets  the  system  infer  that  Cl  is  either 
part  of  a  RenameByCopy  or  Modify.  A  new  name  *11  is  generated  (by  existential  in¬ 
stantiation)  for  the  disjunctively-described  event. 
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End(*Il)  a 

(  (RenameByCopy(*Il)  a  Cl»sl(*Il) ) 

v 

(Modify(*Il)  a  Cl=sl(*Il) ) 

) 

Statement  (2)  is  encoded: 

Copy(C2)  a  old(C2)»jack  a  new(C2)=sprat  a  Before(time(Cl),  time(C2)) 

Again  the  system  creates  a  disjunctive  description  for  the  event  *12,  which  has  C2  as  a 
component 

End(*I2)  a 

(  (RenameByCopy(*I2)  a  C2»sl(*I2) ) 

v 

(Modify(*I2)  a  C2-sl(*I2) ) 

) 

The  next  step  is  to  minimize  the  number  of  End  events.  The  system  might  attempt  to 
apply  the  strongest  minimization  default,  that 

Vx,y  .  End(x)  a  End(y)  3  x»y 

However,  doing  so  would  lead  to  a  contradiction.  Because  the  types  RenameByCopy 
and  Modify  are  disjoint  *I1»*I2  would  imply  that  C1=C2;  however,  the  system 
knows  that  Cl  and  C2  are  distinct  ~  among  other  reasons,  their  times  are  known  to  be 
not  equal.  The  next  strongest  minimality  default  that  there  are  two  End  events,  cannot 
lead  to  any  new  conclusions. 

Statement  (3),  the  act  of  deleting  "foo",  is  encoded: 

Delete(C3)  a  file(C3)-foo  a  Before(time(C2),  time(C3)) 

The  system  infers  by  the  decomposition  completeness  assumption  for  Delete  that  the 
user  is  performing  a  RenameByCopy  or  a  Modify.  The  name  *13  is  assigned  to  the  in¬ 
ferred  event 


7A-D-  88 


End(*D)  a 

(  (RenameByCopy(*I3)  a  C3=s2(*I3)  ) 
v 

(Modify(*I3)  a  C3-s3(*I3)  ) 

) 

Again  the  system  tries  to  minimize  the  number  of  End  events.  The  second  strongest 
minimality  default  says  that  there  are  no  more  than  two  End  events. 

Vx,y,z  .  End(x)  a  End(y)  a  End(z)  z> 
x-y  v  x-z  v  y-z 

In  this  case  the  formula  is  instantiated  as  follows. 

*11**12  v  *I1«*I3  v  *12**13 

We've  already  explained  why  the  first  alternative  is  impossible.  Thus  the  system 
knows 


*I1-*D  v  *12-* B 

The  system  then  reduces  this  disjunction  by  reasoning  by  cases.  Suppose  that 
*12**13.  This  would  mean  that  the  sequence 

(2)  %  copy  jack  sprat 

(3)  %  delete  foo 

is  either  part  of  a  RenameByCopy  or  of  a  Modify,  described  as  follows. 

End(*I2)  a 

(  (RenameByCopy(*I2)  a  C2-sl(*I2)  a  C3=s2(*I2)  a 
.  X  old(*I2)  -  jack  a  old(*I2)  *  foo ) 
v 

(Modify (*I2)  a  C2-sl(*I2)  a  C3-s3(*I2)  a 
file(s3(*I2))  -  foo  a 
new(sl(*I2))  -  sprat  a 
X  file(s3(*I2))  -  new(sl(*I2))) ) 

) 
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But  both  disjuncts  are  impossible,  since  the  files  which  appear  as  roles  of  each  event  do 
not  match  up  (as  marked  with  X's).  Therefore,  if  the  minimality  default  holds,  it  must 
be  the  case  that 


*I1-*I3 

This  means  that  the  observations 

(1)  %  copy  foo  bar 

(3)  %  delete  foo 

should  be  grouped  together,  as  part  of  a  Rename  or  Modify.  This  assumption  leads  the 
system  to  conclude  the  disjunction: 

End(*Il)  a 

(  (RenameByCopy(*I  1)  a  Cl»sl(*Il)  a  C3=s2(*Il)  a 
old(*H)  -  foo  a  new(*Il)  «  bar ) 
v 

(Modify  (*I1)  a  Cl-sl(*Il)  a  C3«s3(*I1)  a 
file(s3(*Il))  -  foo  a 
new(sl(*Il)) »  bar  a 
X  file(s3(*Il))  -  new(sl(*Il))) ) 

) 

The  second  alternative  is  ruled  out,  since  the  actions  cannot  be  part  of  the  same  Modify. 
The  system  concludes  observations  (1)  and  (3)  make  up  a  RenameByCopy  act,  and 
observation  (2)  is  part  of  some  unrelated  End  action. 

End(*Il)  a 

RenameByCopy(*Il)  a 
old(*Il)  -  foo  a  new(*Il)  *  bar  a 
End(*I2)  a 

(  (RenameByCopy(*I2)  a  C2=sl(*I2) ) 
v 

(Modify  (*I2)  a  C2-sl(*I2) ) 

) 

Further  checking  of  constraints  may  be  performed,  but  no  inconsistency  will  arise.  At 
this  point  the  plan  recognizer  may  trigger  the  "advice  giver"  to  tell  the  user 
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***  You  can  rename  a  file  by  typing 
***  %  move  oldname  newname 

6.4.  Medical  Diagnosis 

As  discussed  in  Section  1.4,  there  are  close  links  between  the  kinds  of  reason¬ 
ing  involved  in  plan  recognition,  and  that  employed  in  medical  diagnosis.  The  follow¬ 
ing  example  is  drawn  from  [Pople  82],  and  was  handled  by  CADUCEUS,  an  expert 
system  that  deals  with  internal  medicine.  We  have  made  a  number  of  simplifications, 
but  the  key  points  of  the  example  remain.  These  include  the  need  to  consider  special¬ 
izations  of  an  pathological  state  (abstract  event  type)  in  order  to  explain  a  symptom, 
finding,  or  state,  and  the  process  of  combining  or  unifying  the  tasks  invoked  by  each 
finding.  This  combination  process  corresponds  to  the  minimization  of  End  events  in 
the  plan  recognition  framework. 

6.4.1.  Representation 

The  figure  below  illustrates  a  small  part  of  CADUCEUS’s  knowledge  base. 
The  thin  "component"  arcs  are  here  understood  as  meaning  "can  cause".  We've  added 
the  type  End  as  an  abstraction  of  all  pathological  states  which  are  not  caused  by  other 
pathological  states.  The  basic  specializations  of  End  are  called  specific  disease  entities. 
We've  simplified  the  hierarchy  by  making  the  specializations  of  anemia  and  shock 
specific  disease  entities;  in  the  actual  knowledge  base,  anemia  and  shock  are  caused  by 
other  conditions. 
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figure  63:  Medical  Hierarchy 

The  logical  encoding  of  this  network  is  as  expected.  The  symptoms  caused  by 
a  disease  appear  in  the  decomposition  axiom  for  that  disease.  This  is  a  considerable 
simplification  over  the  original  CADUCEUS  model,  in  which  the  causal  connections 
need  only  be  probable.  Symptoms  of  high  probability,  however,  are  taken  by  CA¬ 
DUCEUS  as  necessary  manifestations,  and  CADUCEUS  will  rule  out  a  disease  on  the 
basis  of  the  absence  of  such  symptoms.  The  constraints  that  appear  at  the  end  of  a  de¬ 
composition  axiom  would  include  conditions  of  the  patient  that  are  (very  nearly)  neces¬ 
sary  for  the  occurrence  of  the  disease,  but  are  not  themselves  pathological.  These  could 
include  the  age  and  weight  of  the  patient,  his  immunization  history,  and  so  on.  Thus  a 
constraint  on  Alzheimer's  disease  would  include  the  fact  that  the  patient  is  over  40. 

A  few  of  the  axioms  and  assumptions  follow.  All  kinds  of  hyperbilirubin¬ 
emia  cause  Jaundice  ,  and  both  anemia  and  shock  cause  pallor  . 

V  y  .  hyperbilinibinemia(y)  o  jaundice(j(y)) 

V  y  .  anemia(y)  z>  pallor(p(y)) 
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V  y  .  shock(y)  3  pallor(p(y)) 

Unconjugated  hyperbilirubinemia  is  a  kind  of  hyperbilirubinemia,  which 
can  be  caused  by  hemolytic  anemia,  a  kind  of  anemia. 

V  x .  unconjugated-hyperbilirubinemia(x)  3  hyperbilirubinemia(x) 

V  y  .  hemolytic-anemia(y)  3  hyperbilirubinemia(h(y)) 

V  x .  hemolytic-anemia(x)  3  anemia(x) 

It  may  seem  a  bit  odd  that  we  need  to  use  a  first-order  language,  when  the 
problem  would  seem  to  be  expressible  in  purely  propositional  terms.  The  problem  with 
using  propositional  logic  arises  from  the  abstract  pathological  states.  A  realistic  medical 
knowledge  base  incorporates  several  methods  of  classifying  diseases,  leading  to  a 
complex  and  intertwined  abstraction  hierarchy.  It  is  very  likely  that  any  patient  will 
manifest  at  least  two  distinct  pathological  states  (perhaps  causally  related)  that  specialize 
the  same  state.  In  a  purely  propositional  system  such  a  pair  would  appear  to  be  com¬ 
petitors,  as  in  a  differential  diagnosis  set  But  this  would  plainly  be  incorrect  if  the 
two  stales  were  causally  related. 

6.4.2.  Assumptions 

Now  we  restrict  our  attention  to  the  covering  models  of  this  hierarchy.  The  ex¬ 
haustiveness  assumptions  include  the  fact  that  every  case  of  hyperbilirubinemia  is 
either  conjugated  or  unconjugated. 

V  x  .  unconjugated-hyperbilirubinemia(x)  3 

conjugated-hyperbilirubinemia(x)  v 
unconjugated-hyperbilirubinemia(x) 

Disjointedness  assumptions  include  the  fact  that  the  pathological  states  of  anemia, 
shock,  and  hepatobilary  involvement  are  distinct  It  is  important  to  note  that  this 
does  not  mean  that  the  states  cannot  occur  simultaneously;  rather,  that  none  of  these 
states  abstract  each  other. 

V  x  .  -ianemia(x)  v  -tshock(x) 
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V  x .  -ianemia(x)  v  -ihepatobilary-involvement(x) 

V  x .  -ishock(x)  v  -ihepatobilary-involvement(x) 

Finally,  the  component/use  assumptions,  better  called  the  manifestation/ cause  assump¬ 
tions,  allow  one  to  conclude  the  disjunction  of  causes  of  a  pathological  state,  thus  cre¬ 
ating  a  differential  diagnosis  set  An  important  special  case  occurs  when  there  is  only 
one  cause,  usually  at  a  fairly  high  level  of  abstraction,  for  a  state.  An  example  of  this  is 
the  association  of  jaundice  with  hyperbilirubinemia.  (Pop le  calls  this  case  a  con¬ 
strictor  relationship  between  the  manifestation  and  cause,  and  argues  that  such  cases 
play  a  critical  role  in  reducing  search  in  diagnostic  problem  solving.)  A  less  conclusive 
assumption  says  that  pallor  indicates  anemia  or  shock. 

V  x  .  jaundice(x)  3  3  y  .  hyperbilirubinemia(y)  a  x=j(y) 

V  x  .  pallor(x)  3 

(3  y  .  anemia(y)  a  x*p(y))  v 
(3  y  .  shock(y)  a  x-p(y)) 

6.4.3.  The  Problem 

Well  sketch  the  kind  of  reasoning  that  goes  on  when  the  diagnostician  is  con¬ 
fronted  with  two  symptoms,  jaundice  and  pallor.  From  jaundice  the  diagnostician 
concludes  hyperbilirubinemia.  This  leads  (by  exhaustion)  to  either  the  conjugated 
or  unconjugated  varieties.  Now  CADUCEUS  (and  perhaps  a  human  physician?) 
may  try  to  perform  tests  to  decide  between  these  alternatives  at  this  point  The  frame¬ 
work  we've  developed,  however,  allows  us  continue  inference  in  each  alternative.  The 
first  leads  to  hemolyic  anemia  and  then  anemia;  the  second  to  three  different  kinds 
of  hepatobilary  involvement.  The  following  graph  shows  the  final  conclusion. 
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figure  6.4:  Conclusions  from  Jaundice 

The  graph  represents  the  following  logical  sentence.  (Further  steps  in  this  example 
with  be  illustrated  only  in  the  graphical  form.) 

jaundice(*Jl)  a  hyperbilirubinemia(*Hl)  a 


( 

( 

unconjugated-hyperbilimbinemia(*H  1 )  a 
hemolytic-anemia(*El)  a 

) 

anemia(*El) 

V 

( 

conjugated-hyperbilirubinemia(*Hl)  a 
(  Gilberts-disease(*El)  v 

bilary-tract(*El)  v 
heptoceLlular-involvement(*E  1 ) 

) 

)  A 

hepatobilary-involvement(*E  1 ) 

)  A 

End(*El) 
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Next  the  diagnostician  considers  pallor.  This  leads  to  a  simple  disjunction. 
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figure  6-5:  Conclusions  from  Pallor 

Finally,  the  diagnostician  applies  Occam's  razor,  by  making  the  assumption  that  the 
symptoms  are  caused  by  the  same  disease.  This  corresponds  to  equating  the  specific 
disease  entities  at  the  highest  level  of  abstraction  (End)  in  each  of  the  previous  conclu¬ 
sions.  In  other  words,  we  apply  the  strongest  minimum  cardinality  default.  This  al¬ 
lows  the  diagnostician  to  conclude  that  the  patient  is  suffering  from  hemolytic  ane¬ 
mia,  which  has  led  to  unconjugated  hyperbilirubinemia. 
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figure  6.6:  Conclusions  from  Jaundice  and  Pallor 

A  practical  medical  expert  system  must  deal  with  a  great  many  probleirs  not  il¬ 
lustrated  by  this  simple  example.  We  have  not  dealt  with  the  whole  problem  of  gener¬ 
ating  appropriate  tests  for  information;  we  cannot  assume  that  the  expert  system  is  pas- 
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sively  soaking  in  information,  like  a  person  reading  a  book.  The  full  CADUCEUS 
knowledge  base  is  enormous,  and  it  is  not  clear  whether  the  complete  algorithms  dis¬ 
cussed  in  the  next  chapter  could  deal  with  it  It  does  seem  plain,  however,  that  our 
framework  for  plan  recognition  does  formalize  some  key  parts  of  diagnostic  reasoning. 
The  rich  and  highly  structured  knowledge  base  required  for  medical  diagnosis  makes  it 
a  very  good  domain  for  exercising  any  formal  theory  of  non-deductive  inference. 
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Chapter  7 

Algorithms  for  Plan  Recognition 

7.1.  Directing  Inference 

Two  related  problems  arise  in  implementing  our  theory  of  plan  recognition:  in¬ 
ference  must  be  directed  toward  some  particular  goal,  and  must  be  limited  in  some 
manner  to  insure  that  our  programs  don't  run  forever.  The  pattern  of  inference  appar¬ 
ent  in  the  previous  examples  suggests  some  answers:  from  each  observation,  apply 
Component/Use  assumptions  until  an  instance  of  type  End  is  reached.  That  is,  create  a 
proof  that  some  instance  of  End  occurs.  Reduce  the  number  of  alternatives  (or  cases  in 
the  proof)  by  checking  constraints  locally.  In  order  to  combine  information  from  two 
observations,  equate  the  instances  of  End  inferred  from  each  and  propagate  the  equali¬ 
ty,  further  reducing  disjunctions.  If  all  alternatives  are  eliminated,  then  conclude  that 
the  observations  belong  to  distinct  End  events.  Multiple  simultaneous  End  events  can 
be  recognized  by  considering  all  ways  of  grouping  the  observations. 

These  operations  suggest  a  graph  based  implementation,  rather  than  one  which 
stores  sentences  in  clausal  form.  A  graph  is  built  bottom-up,  from  a  node  which  repre¬ 
sents  an  observed  event,  to  a  node  which  represents  an  End  event  When  we  need  to 
infer  up  the  abstraction  hierarchy,  we  create  a  new  node  of  the  more  abstract  kind,  and 
record  the  the  more  specialized  node  as  an  alternative  for  the  abstract  node.  If  this  is 
done  properly,  the  result  can  be  viewed  as  an  and/or  graph,  rooted  at  the  End  node. 
The  alternative  (OR)  arcs  are  marked  with  an  equality  sign  (*),  and  the  component 
(AND)  arcs  are  labeled  with  the  role  function  name.  The  figure  below  shows  the  graph 
that  is  generated  an  observation  of  MakeMannara. 
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figure  7.1:  E-Graph  for  MakeMarinara 

These  structures  are  called  explanation  graphs,  or  e-graphs.  The  graph  has  two  com¬ 
plementary  interpretations.  It  can  be  viewed  as  a  proof  that  an  instance  of  End  occurs, 
given  instances  of  the  events  in  its  leaves.  It  can  also  be  interpreted  as  a  sentence  of 
FOL,  which  states  that  an  instance  of  End  occurs,  that  this  instance  is  equal  to  one  of  a 
number  of  other  tokens  of  more  specific  type,  and  so  on.  The  graph  above  can  be  in¬ 
terpreted  as  the  sentence 

3  n7,n<j,n5,n4,n3,n2,ni  .  End(n7)  a  n7»ng  a  PrepareMealCng)  a 
(  (  ng»n4  a  MakePastaDish(n4)  a 

n4*n2  a  MakeSpaghettiMarinara(n2)  a 
step2(n2)-ni  a  MakeMarinara(n  \ ) ) 
v  ( ng*n5  a  MakeMeatDish(n5)  a 

n5*n3  a  MakeChickenMarinara(n3 )  a 
step5(n3)»ni  a  MakeMarinara( n \ ) ) ) 

Section  7.2.3  below  states  the  rules  for  translating  an  explanation  graph  into  an  FOL 
sentence.  The  algorithms  described  in  this  chapter  provide  the  specialized  and  deter- 
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ministic  proof  rules  for  constructing  explanation  graphs,  given  some  initial  descriptions 
of  event  instances. 

Each  e-graph  describes  one  End  event  The  occurrence  of  several  End  events  is 
represented  by  a  conjunctively-joined  set  of  e-graphs,  called  an  hypothesis.  To  account 
for  all  possible  ways  of  grouping  the  observed  events,  it  may  be  necessary  to  consider 
a  disjunctively  joined  set  of  hypotheses.  The  interpretation  of  such  a  set  of  hypotheses 
is  a  sentence  true  in  all  minimum  covering  models  of  the  observations. 

7.2.  Explanation  Graphs 

7.2.1.  Basic  Elements  of  an  E-Graph 

An  e-graph  is  made  up  of  various  relations  over  a  set  of  values.  A  value  can  be 
a  rigid  designator,  a  fuzzy  temporal  constraint,  or  a  node. 

•  Rigid  designators  are  unique  names  for  objects;  distinct  rigid  designators  refer  to  dis¬ 
tinct  objects.  It  is  a  convenient  philosophical  fiction  that  proper  names  like  "John 
Smith"  are  rigid  designators.  The  rigid  designators  can  be  identified  with  a  distin¬ 
guished  subset  of  FOL  constants. 

•  A  time  interval  can  be  thought  of  as  a  pair  of  real  numbers,  representing  its  beginning 
and  ending  instance  according  to  some  absolute  scale.  Yet  we  can  rarely  specify  a 
particular  time  interval  in  so  precise  a  fashion.  Instead,  we  write  expressions  which 
place  some  relative  constraint  on  the  interval.  Various  schemes  have  been  proposed  for 
writing  down  these  constraints,  such  as  Allen's  interval  operators,  which  have  been 
used  in  this  thesis  up  to  this  point  A  special  but  very  useful  kind  of  constraint  can  be 
represented  by  a  list  of  four  numbers.  Where  I  is  a  time  interval,  and  start-min,  start- 
max,  end-min,  and  end-max  are  numbers  on  some  absolute  temporal  scale, 

I  €  (start-min,  start-max,  end-min,  end-max) 

is  true  when  the  first  instance  of  I  falls  between  start-min  and  start-max  inclusive,  and 
the  final  instance  of  I  falls  between  end-min  and  end-max  inclusion.  A  large  fragment 
of  Allen's  interval  logic  can  be  translated  into  an  algebra  over  these  "fuzzy"  temporal 
constraints.  Appendix  D  shows  how  this  may  be  done.  The  main  advantage  of  this 
approach  is  that  intervals  are  all  related  to  a  single  universal  scale,  so  we  do  not  need  to 
maintain  a  table  relating  every  interval  to  every  other  interval  (as  in  [Allen  1983b]). 
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Furthermore,  disjunctions  in  the  e-graph  can  lead  to  disjunctive  temporal  constraints 
between  different  sets  of  intervals,  and  such  disjunctive  constraints  cannot  be  handled 
by  a  straightforward  implementation  of  Allen's  logic. 

•  A  node  represents  any  object  which  is  not  given  a  rigid  designator,  and  in  particular, 
all  event  instances.  Nodes  are  a  distinguished  subset  of  the  constants  of  FOL.  Every 
e-graph  includes  a  function  type  which  maps  every  node  to  a  unique  unary  predicate. 
In  particular,  every  event  node  is  mapped  to  a  member  of  of  Hg.  An  e-graph  has  ex¬ 
actly  one  node  of  type  End. 

7.2.2.  Roles  of  Event  Types  and  of  Nodes 

The  formulation  of  events  described  in  Chapter  2  treats  the  roles  of  an  event  in¬ 
stance  as  the  unary  functions  which  apply  to  the  instance.  Because  an  event  token  may 
be  of  several  types,  it  is  useful  to  define  the  subset  of  roles  of  an  instance  which  are 
relevant  to  each  particular  type  of  the  token.  For  example,  a  token  :C  may  be  of  types 
Change-Location  and  Ride-Train.  The  roles  of  :C  include  destination  and  train-num¬ 
ber.  While  both  roles  are  relevant  when  one  thinks  of  :C  as  a  Ride-Train,  only  the 
former  makes  sense  when  one  thinks  of  :C  as  a  Change-Location. 

The  association  of  roles  with  types  is  an  important  principle  for  organizing  the 
construction  of  an  e-graph.  Events  are  abstracted  in  order  to  collapse  the  search  space. 
Some  of  the  roles  of  the  more  specialized  types  of  the  event  are  ignored  in  the  process. 
Unless  one  "abstracts  away"  the  roles  in  this  manner,  it  is  not  possible  to  have  different 
alternatives  for  a  single  abstract  event  Consider  the  example  above.  The  step2  role  of 
N2  is  filled  by  Nj.  When  N2  is  viewed  abstractly  as  N$,  the  step2  role  is  eliminated. 
This  allows  N$  to  also  abstract  N3,  which  uses  N 1  to  fill  its  step5  role.  The  exact  re¬ 
lationship  between  Ng  and  N  j  is  left  open. 

Not  all  roles  of  an  event  are  components,  of  course.  Others  include  the  agent  of 
the  event,  its  time,  the  objects  involved,  and  so  on.  Any  of  these  roles  are  also  candi¬ 
dates  for  being  abstracted  away. 

Definitions.  The  roles  of  an  event  type  E  are  the  unary  functions  which  appear 
in  the  decomposition  axiom  for  any  type  which  abstracts'"  E,  and  apply  to  the  variable 
which  appears  in  the  main  antecedent  of  the  axiom.  The  component  roles  of  an 
event  type  are  the  roles  of  the  event  type  which  yield  an  event  token  when  applied  to 
an  event  token.  The  parameters  of  an  event  type  are  all  the  roles  of  the  event  type 
which  are  not  component  roles. 
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In  an  explanation  graph,  every  node  corresponding  to  an  event  token  has  a 
unique  associated  type.  Therefore  we  can  talk  of  the  pairs  of  role  functions  and  role 
values  of  a  node  as  follows. 

Definitions.  A  roleval  of  a  node  N  is  a  pair  (fr,v)  such  that 

1.  fr  is  a  role  of  type(N). 

2.  v  is  a  value,  as  defined  above. 

3.  v«fr(N). 

The  known  rolevals  of  N  are  simply  the  rolevals  of  N  for  which  we  can  compute 
the  value  v.  Sometimes  we  speak  of  a  roleval  without  reference  to  a  particular  node, 
meaning  simply  a  (fr,v)  pair.  The  component  and  parameter  rolevals  of  N  are 
defined  in  the  obvious  way. 

7.2.3.  Definition  of  an  Explanation  Graph 

An  explanation  graph  G  »  (Vr,  Vt,  Vn,  Type,  R,  Rolevals,  Alt),  where: 

Vr  is  the  set  of  rigid  designators. 

Vt  is  the  set  of  fuzzy  temporal  constraints. 

Vn  is  the  set  of  nodes. 

Type  is  a  function  from  nodes  to  event  types. 

Type:  Vn  =>  He 

R  is  the  set  of  roles. 

Rolevals  is  the  known  roleval  relation,  over  node,  roles,  and  values. 

Rolevals  c  Vn  x  R  x  {Vr  uVt  u  Vn} 

Alt  is  the  altematives-for  function,  from  nodes  to  sets  of  nodes. 

Alt :  Vn  =>  2Vn 

An  interpretation  of  a  node  n  in  G  is  the  following  sentence: 
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Interpretation(n)  * 

{Ei(n)  |  typc(n)«Ej}  a 

A{v»fr(n)  a  S  |  (n,fr,v)  6  Rolevals  a  ve  Vn  a 
S  » Interpretation  v)}  a 

A{v-f f(n)  |  (n,fr,v)  e  Rolevals  a  ve  Vr  }  a 

A{fj(n)  s  v  j  (n/r,v)  6  Rolevals  a  ve  Vt  }  a 

V{n»m  a  S  j  m  s  Alt(n)  aS*  Interpretation(m)} 

The  sentence  states  that  n  is  of  the  specified  type;  that  it  has  the  specified  values  as 
roles;  and  that  it  must  be  equal  to  one  of  a  set  of  other  nodes  .The  interpretation  of  an  e- 
graph  is  the  interpretation  of  its  end  node,  with  nodes  replaced  by  existentially-quanti¬ 
fied  variables. 

Interpretation^) » 

3  xi, ...  xfc  .  Ihterpretation(h)  9 

where 

h  «  Vn  a  type(h)«End 
0  -  (ni/xi, ...  nfc/xfc) 

(ni, ...  nfc}  ■  Vn 

It  is  important  to  understand  that  an  e-graph,  unlike  a  lexical  hierarchy,  has  ex¬ 
istential  import  It  is  used  to  represent  the  fact  that  some  particular  tokens  are  of  some 
types,  and  form  components  of  other  tokens.  A  lexical  hierarchy  has  no  existential  im¬ 
port  It  only  asserts  that  if  there  are  any  tokens  of  some  type,  then  they  all  must  have 
certain  properties. 

The  treatment  of  the  alternative  relation  distinguishes  e-graphs  from  previous 
work  that  uses  "semantic  nets"  to  represent  sentences.  Systems  such  as  KL-1 
[Brachman  85]  can  only  represent  conjunctions  of  atomic  assertions.  There  is  no  way 
to  assert  that  a  particular  entity  is  either  of  type  Ej  or  of  type  E2.  More  recent  work  on 
semantic  nets,  such  as  KRYPTON  [Brachman,  Fikes,  &  Levesque  85],  has  gained  the 
ability  to  make  disjunctive  assertions  by  adding  a  non-network  component  which  holds 
sentences  containing  arbitrary  disjunctions  and  conjunctions.  Such  systems  use  a  gen¬ 
eral-purpose  theorem  prover  to  make  inferences  from  these  assertions.  General  theo¬ 
rem  proving  is  both  theoretically  and  practically  intractable,  and  little  is  known  about 
controlling  and  focusing  their  power.  The  special  form  of  an  e-graph,  however,  lets  us 
closely  control  specific  kinds  of  reasoning  with  disjunctions. 
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7.3.  Computing  the  Uses  of  an  Event  T ype 

As  we  have  hinted  above,  the  first  stage  of  the  recognition  algorithm  is  to  prove 
that  an  observed  event  token  is  a  component  of  an  End  event,  by  considering  all  the 
possible  uses  of  the  observed  event  The  Uses  relation  is  roughly  -  but  only  roughly  - 
the  inverse  of  die  direct  component  relation.  Starting  from  an  event  of  a  particular  type 
-  say,  GetN ewspaper(C  \ )  -  we  consider  all  the  uses  of  Cj  as  a  GetNewspaper  -  such 
as  a  step  of  Becomelnformed.  Then  we  consider  the  uses  of  C\  for  types  which  ab¬ 
stract  and  specialize  GetNewspaper.  So  if  GetPaper  abstracts  GetNewspaper,  we  then 
consider  the  uses  of  Cj  as  a  GetPaper,  which  differ  from  the  uses  already  considered  - 
such  as  a  step  of  BuildFire.  (As  will  be  seen  below,  we'd  actually  use  different  nodes 
for  each  abstraction  and  specialization  of  Ci-)  Finally,  it  may  be  necessary  to  consider 
uses  of  types  which  specialize  the  type  of  the  original  observation.  To  allow  for  the 
possibility  that  Ci  could  be  an  instance  of  GetSleazyTabloid,  we  consider  the  use  of  the 
observed  event  as  a  step  of  Enjoy  VicariousTitillation. 

7.3.1.  Use  Abstraction  and  Specialization 

Just  as  one  can  talk  of  one  type  abstracting  another,  one  can  talk  of  abstraction 
of  the  component  relationship  between  types.  In  the  cooking  world,  the  relationship 
between  MakeSauce  and  MakePastaDish  abstracts  the  relationship  between  MakeMari- 
nara  and  MakeSpaghettiMarinara.  In  order  to  explain  an  instance  of  MakeMarinara, 
one  considers  MakeSpaghettiMarinara  and  MakeChickenMarinara.  It  is  not  necessary 
to  consider  the  path  from  MakeMarinara  to  MakeSauce  to  MakePastaDish,  because  that 
way  of  using  the  instance  abstracts  a  more  specific  use  of  the  instance. 

A  use  is  a  triple,  (Eg,  fr,  Eu),  and  stands  for  the  possibility  that  (some  instance) 
of  Eq  could  fill  the  fr  role  of  some  instance  of  Eu.  For  example,  when  trying  to  explain 
event  instance  C},  this  use  would  mean 

3  y  •  Eu(y)  a  Ci*fr(y)  a  Ec(Ci) 

The  abstraction  relation  between  uses  is  defined  as  follows. 

Definition.  (Eac,  fr?  Eau)  abstracts  (Ec,  fr,  Eu)  exactly  when  Eac  abstracts  Ec,  and 
Eau  abstracts  Eu.  When  is  either  the  same  as  or  abstracts  U2,  we  say  U 1  abstracts* 
U2-  Specializes  and  specializes*  are  the  inverse  of  abstracts  and  abstracts*  respective¬ 
ly- 
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7.3.2.  The  Uses  and  Direct  Component  Relations 

It  is  sometimes  necessary  to  consider  uses  of  a  type  of  which  that  type  is  not  a 
direct  component  Consider  the  following  extension  to  the  cooking  hierarchy. 


Make 

Pasta 


figure  72:  Extended  Cooking  Hierarchy  Detail 

To  explain  an  instance  of  MakeSpaghetti,  one  considers  the  use  (MakeSpaghetti,  sj, 
MakeSpaghettiMarinara).  The  use  (MakeNoodles,  sj,  MakePastaDish)  is  not  consid¬ 
ered,  because  it  abstracts  the  former  use.  But  then  a  possibility  is  unfairly  ignored!  An 
event  of  type  MakeSpaghetti  could  also  be  a  component  of  an  event  of  type  Make- 
NoodlesPrimavera,  because  that  type  does  not  further  specify  the  type  of  its  si  role. 
Therefore  the  set  of  Uses  to  consider  should  include  (MakeSpaghetti,  si,  Make- 
NoodlesPrimavera). 

The  next  section  precisely  specifies  how  Uses  augments  the  set  of  direct  com¬ 
ponents,  in  order  to  guarantee  that  the  recognition  algorithms  are  correct  The  set  Uses 
needs  only  be  computed  once  and  for  all,  before  any  observations  are  input  Therefore 
we  need  not  be  concerned  with  finding  an  efficient  algorithm  to  compute  Uses;  practi¬ 
cally  any  exhaustive,  off-line  method  is  sufficient 


7.3.3.  Definition  of  Uses 
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7.3.4.  Example 

Following  are  the  uses  calculated  at  each  stage  for  MakeNoodles  and  Make- 
Spaghetti,  using  the  extended  hierarchy  above. 

U0  -  {  (MakeNoodles,  si,  MakePastaDish) 

(MakeNoodles,  si,  MakeFettuciniAlfredo) 

(MakeNoodles,  si,  MakeSpaghetdMarinara) 

(MakeNoodles,  si,  MakeNoodlesPriinavera) 

(MakeS  paghetti,  si,  MakeSpaghetdMarinara) 

(MakeSpaghetti,  si,  MakePastaDish) 

(MakeS  paghetti,  si,  MakeFettuciniAlfredo) 

(MakeSpaghetti,  si,  MakeNoodlesPriinavera) } . 

Next,  eliminate  the  impossible  uses. 

Up  »  {  (MakeNoodles,  si,  MakePastaDish) 

(MakeNoodles,  si,  MakeFettuciniAlfredo) 

(MakeNoodles,  si,  MakeSpaghetdMarinara) 

(MakeNoodles,  si,  MakeNoodlesPriinavera) 

(MakeSpaghetti,  si,  MakeSpaghetdMarinara) 

(MakeSpaghetti,  si,  MakePastaDish) 

(MakeSpaghetti,  si,  MakeNoodlesPriinavera) } 

Throw  out  uses  where  some  specializations  of  the  user  type  are  not  uses. 

Ux  »  { (MakeNoodles,  si,  MakePastaDish) 

(MakeNoodles,  si,  MakeFettuciniAlfredo) 

(MakeNoodles,  si,  MakeSpaghetdMarinara) 

(MakeNoodles,  si,  MakeNoodlesPriinavera) 

(MakeSpaghetti,  si,  MakeSpaghetdMarinara) 

(MakeSpaghetti,  si,  MakeNoodlesPriinavera) } 

Simplify  this  set  by  eliminating  sets  of  uses  which  exhaustively  specialize  another  use. 

Uses  *  {  (MakeNoodles,  si,  MakePastaDish) 

(MakeSpaghetd,  si,  MakeSpaghetdMarinara) 

(MakeSpaghetti,  si,  MakeNoodlesPriinavera) } 

Thus  from  MakeNoodles  one  should  consider  the  use  MakePastaDish,  while  two 
search  paths  start  from  MakeSpaghetd,  one  through  MakeSpaghetdMarinara  and  the 
other  through  MakeNoodlesPriinavera. 


7.4.  Constraint  Checking 
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Given  a  component  event  type,  Uses  yields  a  very  minimal  set  of  use  types 
which  describes  those  types  whose  instances  could  have  a  component  of  the  given 
type,  but  do  not  require  the  component  to  be  of  a  particular  subtype  of  the  given  type. 

A  set  of  types  describes  another  set  of  types  if  every  member  of  the  latter  set 
specializes*  or  abstracts*  a  member  of  the  former  set  A  set  of  types  is  minimal  if  no 
member  abstracts  another  member,  and  is  very  minimal  if  it  also  does  not  contain  a 
subset  {Ei, ...,  Eq}  which  forms  an  exhaustive  set  of  specializations  for  some  type 
Eq.  (A  very  minimal  set  should  contain  Eo  instead.)  We  define  Uses  in  four  steps,  as 
follows. 

U<x  *  {  (Eo  *r>  Eu)  I  3  E^u»  Eac»  E^u  • 

Ec  is  the  fr  direct  component  of  E^u  a 

Eac  abstracts*  Ec  a 

Eac  is  the  fr  direct  component  of  Eau  a 
Eau  abstracts*  Eu  } 

Up  *  {(Ec,  fp  Eu)  |  (Ec,ff,Eu)  6  Ua  a 

Hucl(H)  j-Vx  .Eu(x)3  -,Ec(fr(x))}  . 

Uz  *  {(Ec,  fp  Eu)  |  (Ec^r»Eu)  €  Up  a 

^  E$u  .  Eu  abstracts  E$a  "D  (Ec,  fp  E$a)  €  Up  } 

Uses  *  {(Ec,  fp  Ea)  I  (Ec,fpEu)  €  U^  a 

->3  Eaa .  Ejm  abstracts  Ea  a  (Ec,  fp  Eaa)  €  U^  } 

This  rather  complicated  construction  can  be  understood  as  follows.  We  start  by  con¬ 
sidering  Eq  which  have  some  explicit  direct  uses.  Then  we  add  uses  from  Ec  to  all  the 
types  which  directly  use  (in  the  same  role)  abstractions  of  Ec-  Next  we  add  uses  from 
Ec  to  all  types  which  specialize  any  of  the  previous  uses.  (This  is  to  catch  uses  like 
(MakeSpaghetti,  sj,  MakeNoodlesPrimavera)  above.)  This  large  set  is  Ua.  Next  we 
throw  out  those  uses  which  are  not  in  fact  possible,  but  arose  in  the  specialization  step. 
This  yields  Up.  The  next  two  steps  redescribe  Up  in  simpler  terms.  In  Ux  we  throw 
out  all  the  uses  where  the  user  type  has  some  specializations  which  cannot  have  com¬ 
ponents  of  type  Ec-  Then  in  Uses  we  eliminate  a  set  of  uses  if  they  exhaustively  spe¬ 
cialize  a  use  already  under  consideration.  This  corresponds  to  making  uses  very  mini¬ 
mal,  as  described  above. 
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The  algorithms  need  to  check  the  various  constraints  which  appear  in  the  de¬ 
composition  axioms  for  the  event  types.  It  is  never  necessary  to  prove  that  a  constraint 
actually  holds;  rather,  one  should /ail  to  prove  that  the  constraint  is  false.  As  a  side  ef¬ 
fect  of  constraint  checking,  the  values  of  parameters  are  propagated  to  a  node  from  its 
components.  In  Section  2.6.3,  constraints  were  categorized  as  equality  constraints, 
temporal  constraints,  and  preconditions  and  effects;  the  latter  two  jointly  called  fact 
constraints.  Each  kind  of  constraint  is  handled  in  a  different  manner. 

7.4.1.  Equality  Constraints 

An  equality  constraint  usually  equates  a  parameter  of  a  node  with  a  parameter  of  one  of 
its  components.  An  equality  constraint  fails  if  the  values  of  the  two  items  are  distinct 
rigid  designators.  As  a  side  effect,  when  a  value  is  known  for  the  parameter  of  the 
component,  but  none  for  the  parameter  of  the  node,  the  value  is  assigned  to  the  param¬ 
eter  of  the  node. 

7.4.2.  Temporal  Constraints 

Rather  than  passing  around  precise  values  for  time  parameters,  the  implementation 
passes  fuzzy  temporal  constraints.  All  unknown  times  are  implicitly  constrained  by 


(— oo  +oo  — OO  +oo) 

A  temporal  constraint  fails  if  a  time  parameter  is  assigned  an  empty  fuzzy  constraint, 
such  as 


(5  6  2  3) 

There  can  be  no  time  interval  which  starts  between  times  5  and  6,  and  ends  between 
times  2  and  3.  (Time  doesn't  run  backwards!)  Corresponding  to  each  binary  temporal 
relation,  such  as  Before  or  During,  there  an  algebraic  operation  on  fuzzy  constraints, 
with  the  following  property: 

Where  Tj,  T2  are  time  intervals,  and  Z^2  are  fuzzy  constraints: 

Tj  6  Z\  a  Tj  e  Z2  a  Tj  binary-op  T2  o 

Ti  €  Ftransitive(Zi,  binary-op,  Z2) 

This  algebra  is  described  in  Appendix  D.  These  rules  are  used  to  update  the  fuzzy  con¬ 
straints  assigned  to  the  time  parameters  of  the  node  under  consideration.  The  appendix 
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also  defines  the  functions  Fintersection  and  empty,  which  are  used  in  the  algo¬ 
rithms. 


7.4.3.  Fact  Constraints 

Fact  constraints  are  checked  only  if  values  are  known  for  all  the  arguments  of  the 
predicates  in  die  constraint  A  limited  theorem  prover  attempts  to  prove  the  negation  of 
the  fact  The  constraint  fails  if  the  proof  succeeds,  hi  order  to  check  a  fact  of  the  form 

Never(T property) 

the  theorem  prover  attempts  to  show  that  there  exists  a  time  T2  over  which  the  property 
holds,  and  must  intersect  Tj,  in  order  to  satisfy  their  respective  fuzzy  temporal  con¬ 
straints. 

The  implementation  assumes  that  the  general  domain  axioms  Hq  are  such  as  to 
guarantee  finite  failure. 

7.5.  Overview  of  the  Algorithms 

The  algorithms  are  presented  in  a  structured  pseudo-code,  freely  mixing  English 
and  programming  constructs  in  order  to  maximize  clarity.  All  functional  parameters  are 
passed  by  value.  There  are  three  basic  algorithms:  explain-observation,  match- 
graphs,  and  group-observations.  We  present  an  overview  of  the  algorithms,  then 
the  pseudo-code,  and  finally  a  more  detailed  description  of  their  operation. 

7.5.1.  Explain 

The  function  call  explain-observation(etype,edescr)  builds  an  e-graph  on 
the  basis  of  a  single  observation  of  an  event  of  etype,  which  satisfies  the  role/value 
pain  in  edescr.  Two  venions  of  the  subroutine  which  checks  for  redundant  search 
paths  are  presented.  The  first  assumes  that  the  abstraction  hierarchy  forms  a  tree.  The 
more  general  venion  contains  a  slightly  more  complex  test  to  avoid  finding  redundant 
paths  from  a  node  to  End. 

Following  is  an  example  of  the  "multi-path  problem"  with  multiple  inheritance. 
Consider  the  following  fragment  of  an  event  hierarchy. 
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figure  73:  Multiple  Inheritance  Hierarchy 

Suppose  the  primary  event  to  be  explained  is  of  type  A.  Then  one  should  consider  the 
use  (C,  $i,  K)  but  not  the  more  general  use  (D,  sj,  L).  (Why?  Because  of  the  ex¬ 
haustiveness  assumption,  every  L  is  either  an  M  or  a  K.  M's  must  have  components  of 
type  N,  and  not  of  type  A.  Every  C  is  an  A.  Therefore,  from  C  it  is  safe  to  infer  K. 
The  weaker  conclusion  of  L  is  not  an  alternative  conclusion.)  On  the  other  hand,  if  the 
primary  node  is  of  type  B,  then  one-should  consider  (D,  sj,  L)  but  not  the  more  spe¬ 
cific  (C,  sj,  K).  (Why?  Because  the  B  could  be  an  N  or  an  A,  so  the  best  one  can  do 
is  conclude  L.)  The  potentially  expensive  test  in  the  more  general  version  of  redun¬ 
dant  "looks  back"  at  the  primary  node  to  see  whether  or  not  a  different  abstraction  path 
from  the  primary  node  could  reach  a  more  specific  use  than  the  one  at  the  current  node. 

An  important  feature  of  the  algorithm  is  the  test  involving  the  consider-spec 
flag.  In  order  to  explain  an  event  of  etypc,  one  must  consider  event  types  that  (1)  ab¬ 
stract  etype;  (2)  specialize  etype;  and  (3)  abstract  types  which  specialize  etypc.  We 
must  not  consider  types  that  specialize  types  that  abstract  etype  -  because  this  includes 
all  of  He- 

7.5.2.  Match 

The  function  call  match-graphs(graphl,graph2)  builds  a  new  graph  which 
equates  the  End  nodes  of  graph  1  and  graph2.  The  match  algorithm  assumes  that  there 
exists  either  no  or  exactly  one  type  which  is  the  "greatest  lower  bound"  of  any  pair  of 
elements  of  He-  Formally: 
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El  and  E2  are  compatible  d 

3  E3  s  He  .  Ej  abstracts*  E3  a  E2  abstracts*  E3  a 

VE4.E1  abstracts*  E4  a  Ei  abstracts*  E4  z> 

E3  abstracts*  E4 

A  technical  term  for  this  is  that  He  is  a  lower  semi-lattice.  The  gib  of  non-compatible 
types  is  by  definition  0. 

The  key  feature  of  match-graphs  is  the  use  of  a  cache  to  store  matches  between 
nodes.  Because  e-graphs  are  digraphs,  rather  than  trees,  it  is  possible  for  the  same  pair 
of  nodes  to  be  visited  more  than  once  in  the  matching  process.  The  node  created  the 
first  time  the  pair  is  visited  is  used  as  the  result  of  any  subsequent  match  of  the  pair. 
Without  this  feature,  match-graphs  would  multiply  the  input  digraphs  into  a  tree, 
possibly  causing  an  exponential  increase  in  the  size  of  the  result  over  the  size  of  the  in¬ 
puts. 


7.5.3.  Group 

The  function  group-observations  continually  inputs  observations  and 
groups  them  into  sets  to  be  accounted  for  by  particular  e-graphs.  The  function  mini- 
mum-Hypoths  is  called  to  retrieve  a  disjunctive  set  of  current  hypotheses,  each  of 
which  is  a  conjunctive  set  of  e-graphs  which  accounts  for  all  of  the  observations,  using 
as  few  End  events  as  possible. 

There  are  three  versions  of  this  algorithm.  The  Non-Dichronic  algorithm  re¬ 
turns  the  same  results  regardless  of  the  order  in  which  the  observations  are  input,  and 
computes  conclusions  mc-entailed  by  the  inputs.  The  Incremental  Minimization 
and  Sticky  versions  correspond  to  the  incremental  theories  of  recognition  discussed  in 
Chapter  5.  The  Incremental  Minimization  algorithm  only  increases  the  number  of 
different  End  events  under  consideration  when  it  has  to,  and  retains  all  previous  con¬ 
clusions.  It  implements  the  imc-entailment  operator.  The  Sticky  algorithm  tries  to 
explain  each  observation  by  making  it  part  of  the  most  recently  invoked  End  event 

7.6.  Pseudo-Code 

7.6.1.  Utility  Functions 


7A-D-  113 


Utility  Functions 

on  Nodes  of  Explanation  Graphs 


function  type(node) 

/*  most  specific  type  assigned  to  the  node  7 


function  roleval(node,  role) 

/*  the  value  of  role  (node)  7 


function  alternatives-for(node) 

P  a  set  {n1t  n^  ....  ny }  such  that 

node»n1  vnode*n2  v ...  v  node-nj  7 


function  status(node) 

r  initially  True  for  all  nodes,  set  to  False  when  the  alternative  con¬ 
taining  the  node  is  ruled  out  7 


1.2. 


Explain-Observation 


Explain-Observation 

global  egraph  /*  stores  the  explanation  graph  built  by  explain  7 

/*  explain-observation 

etype :  type  of  observed  event 

edescr :  role/value  pairs  of  observed  event 

returns 

egraph  :  explanation  graph  rooted  at  End-node-ot (graph) 
7 

function  explain-observation(etype,  edescr)  is 
initialize  egraph 

explain(etype,  edescr,  (etype),  True,  etype) 
return  egraph 
end  build-explanation-graph 
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r  explain 

etype :  type  of  event  to  be  explained 
edescr :  list  of  (role  value)  pairs  which  describe  the  event 
visited :  set  of  event  types  considered  so  far 
consider-spec :  if  true,  then  consider  specializations  of  etype 
primary :  the  type  at  which  search  to  explain  this  event 
instance  began 

returns 

node :  represents  event  token  of  etype 
pathToEndFound :  true  if  successfully  explained  event 
aJIVisited :  visited  augmented  with  types  considered  which 
abstract  etype 
7 

function  explain(etype,  edescr,  visited,  consider-spec,  primary)  Is 
explained  :■  False  /*  need  to  find  a  path  to  End  7 

visited  :»  visited  u  {etype} 
r  Merge  search  paths  if  possible  7 

if  egraph  has  a  node  n  of  etype  which  exactly  matches  edescr  then 
return  (n,  status(n),  visited)  endif 
/*  Cant  merge,  must  consider  a  new  node  7 
create  a  new  node  n  of  etype  which  satisfies  edescr  in  egraph 
if  etype-End  then  return  (n,  True,  visited)  endif 
propagate  equality,  temporal,  and  fact  constraints  at  n 
if  constraints  violated  then 
status(n)  :■  False 
return  (n,  False,  visited)  endif 
f*  Consider  direct  Uses  of  etype  7 
forall  (etype,  r,  utype)  e  Uses 

/*  eliminate  unnecessary  Uses  7 
If  -iredundant(etype,  r,  utype,  visited,  primary)  then 

(-.foundpath,-) explain(utype,  {(r,n)},  0,  True,  utype) 
r  the  r  role  of  the  event  must  be  n  7 
explained  :» foundpath  or  explained 

endif 

endfor 

r  Consider  abstractions  of  etype  7 
forall  atype  e  direct-abstractions(etype) 
if  atype  e  visited  then 

/*  eliminate  unnecessary  abstractions  V 
let  adescr  be  the  role/value  pairs  computed  for  n, 

limited  to  the  non-step  roles  defined  for  atype 
(an, foundpath, visited)  :> 

explain(atype,  adescr,  visited,  False,  primary) 
add  n  as  an  alternative  for  an 
explained  :«  foundpath  or  explained 
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endlf 

endif 

/*  Consider  specializations  of  etype  7 
if  consider-spec  then 

/*  dont  abstract  up  and  then  specialize  back  down  7 
foraii  stype  e  direct-specializations(etype) 

(-.foundpath,-)  :■ 

explain(stype,  edescr,  visited,  True,  primary) 
explained  :■ foundpath  or  explained 

endfor 

endif 

status(n)  :■  explained 
return  (n,  explained,  visited) 
end  explain 


Check  for  Redundant  Paths 

(called  by  Explain) 


/*  Tree  Version  7 

function  redundant(etype,  r,  utype,  visited,  primary)  is 

return  (etype,  r,  utype)  abstracts  or  specializes  a  use 
for  some  member  of  visited 

end  redundant 

/*  Multiple-Inheritance  Version  7 

function  redundant(etype,  r,  utype,  visited,  primary)  is 

return 

((etype,  r,  utype)  abstracts  a  use  for  a  type  which 
abstracts*  primary)  v 

((etype,  r,  utype)  specializes  a  use  for  a  type  which 
specializes*  primary)  v 

(etype  does  not  abstract*  or  specialize*  primary  a 

(etype,  r,  utype)  specializes  a  use  for  a  type  which 
abstracts  primary)  v 

(etype  does  not  abstract*  or  specialize*  primary  a 

(etype,  r,  utype)  abstracts  a  use  for  a  member  of 
visited  a 

(etype,  r,  utype)  does  not  specialize  a  use  for  type 
which  abstracts  primary) 

end  redundant 
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.3. 


Match-Graphs 


Match-Graphs 

global  mgraph  /*  stores  the  graph  created  by  match  7 
/*  match-graphs 

graph  1,  graph2 :  graphs  to  be  matched 
returns 

graph3 :  represents  equating  End  nodes  of  graphs  1  and  2 
and  propagating  equality 

succeeded :  True  if  match  succeeded,  false  otherwise 
7 

function  match-graphs(graph1f  graph2)  Is 
initialize  mgraph 

(-.succeeded)  :■  match(End-node-of(graph1),  End-node-of(graph2)) 
return  (mgraph,  succeeded ) 
end  match-graphs 

hash-table  previous-match  /*  cache  for  results  of  each  call  to  match  7 
/•match 

nl  ,n2:  values  to  be  matched  (either  nodes,  time  intervals,  or 
fuzzy  temporal  constraints) 

returns 

n3 :  value  which  represents  the  result  of  the  match 
succeeded :  if  true,  then  nl  and  n2  could  be  equal 
7 

function  match(n1 ,  n2)  is 

if  nl  and  n2  are  rigid  designators  then 
return  (nl,  n1*n2) 

elseif  nl  and  n2  are  fuzzy  temporal  constraints  then 

return  (  Fintersection(n1  ,n2),  -nempty(Fintersection(nl,n2))  ) 
else  P  nl  and  n2  are  nodes  7 

/•if  nl  and  n2  have  already  been  matched,  reuse  that  result  7 

n3  :»  previous-match(n1,n2) 

if  n3  #  0  then  return  (n3,  status(n3))  endif 

/*  check  that  types  of  nl  and  n2  are  compatible  7 

n3type  :■  greatest  lower  bound  of  (type(nl),  type(n2)} 

If  n3type  *  0  then  return  (-.False)  endif 
create  new  node  n3  of  n3type  in  mgraph 
r  add  all  known  role/values  of  nl  and  n2  to  n3,  matching 
identical  roles  7 
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forall  roles  r  defined  for  n3type 
vl  :■  roleval(nl.r) 
v2  :*  noleval(n2,r) 
if  either  vl  or  v2  is  defined  then 

if  vl  is  defined  but  not  v2  then 
(v3,  okay)  :■  match(vl  ,v1 ) 
eiseif  v2  is  defined  but  not  vl  then 
(v3,  okay) :»  match(v2,v2) 

eiseif 

(v3,  okay)  :■  match(v1  ,v2) 

endif 

if  -.okay  then  return  (n3,  False)  endif 
add  v3  as  the  r  role  of  n3 

endif 

endfor 

check  equality,  temporal,  and  fact  constraints  on  n3 
if  constraints  violated  then 
status(n3)  :*  False 
return  (n3,  False)  endif 
f  match  alternatives  for  each  node  7 
if  either  nl  or  n2  has  an  alternative  then 
for  al  e  alternatives-for(nl) 

(or  al-nl  if  no  alternatives) 
for  a2  e  alternatives*for(n2) 

(or  a2-n2  if  no  alternatives) 

(a3,  okay) :»  match  (al ,  a2) 
if  okay  then 

add  match(al  ,a2)  as  a  alternative 
for  n3 

endif 

endfor 

endfor 

endif 

if  there  were  alternatives  of  nl  or  n2,  but  none  matched  then 
return  (n3, False) 
else  return  (n3,True)  endif 

endif 

end  match. 


7.6.4.  Group  Observations 
7. 6. 4.1.  Non-Dichronic  Version 
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Group-Observations 
Non-Dichronic  Version 


global  Hypoths 

r  Aset  (disjunction)  of  hypotheses,  each  a  set  (conjunction)  of  ex¬ 
planation  graphs .  Each  hypothesis  corresponds  to  one  way  of 
grouping  the  observations.  Different  hypotheses  may  have  different 
cardinalities.  7 

function  minimum-Hypoths  Is 

smallest  :■  min  { card(hypo)  |  hypo  e  Hypoths} 

return  { hypo  |  hypo  e  Hypoths  a  card(hypo)*smallest } 
end  minimum-Hypoths 

procedure  group-observations  is 
Hypoths :» {0} 
while  more  observations 

obtain  observation  (etype,  edescr) 
obs-graph  :»  explain-observation(etype,  edescr) 
forall  hypo  a  Hypoths 

remove  hypo  from  Hypoths 
add  hypo  u  obs-graph  to  Hypoths 
forall  g  e  hypo 

(new-g,  okay) :»  match-graphs(obs-graph,  g) 

If  okay  then 

add  (hypo  -  {g})  u  new-g  to  Hypoths 

endlf 

endfor 

endfor 

endwhile 

end  group-observations 


7. 6. 4.2.  Incremental  Minimization  Version 


Group-Observations 
Incrementa  Minimization  Version 
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global  Hypoths 

r  A  set  (disjunction)  of  hypotheses,  each  a  set  (conjunction)  of  ex- 
planation  graphs.  Each  hypothesis  corresponds  to  one  way  of 
grouping  the  observations.  All  hypotheses  have  the  same  cardinali¬ 
ty,  corresponding  to  the  minimum  number  of  End  events  7 

function  minimum-Hypoths  Is 
return  Hypoths 
end  minimum-Hypoths 

procedure  group-observations  is 
Hypoths  :*  {0} 
while  more  observations 

obtain  observation  (etype,  edescr) 
obs-graph  :■  explain-observation(etype,  edescr) 
old-hypoths  :*  Hypoths 

forall  hypo  e  Hypoths 

remove  hypo  from  Hypoths 
forall  g  e  hypo 

(new-g,  okay) :»  match-graphs(obs-graph,  g) 

If  okay  then 

add  (hypo  -  {g})  u  new-g  to  Hypoths 

endif 

endfor 

endfor 

if  Hypoths«0  then 

forall  hypo  s  Hypoths 

add  obs-graph  to  hypo 

endfor 

endif 

endwhile 

end  group-observations 


7. 6. 4.3.  Sticky  Version 


Group-Observations 
Sticky  Version 


global  hypo 

/*  An  ordered  list  of  e-graphs,  from  most  recent  to  least  recent,  repre¬ 
senting  a  single  hypothesis  7 
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function  minimum-Hypoths  is 
return  {hypo} 
end  minimum-Hypoths 

procedure  group-observations  is 
hypo :»  NIL 

while  more  observations 

obtain  observation  (etype,  edescr) 
obs-graph  :»  explain-observation(etype,  edescr) 
block  update 

foreach  g  e  hypo  in  order 

(new-g,  okay) :»  match-graphs(obs-graph,  g) 
if  okay  then 

hypo  :*  cons{new-g,  remove(g,  hypo)) 
exit  block  update 

endif 

endfor 

hypo  :■  cons(obs-graph,  hypo) 
endblock  update 
endwhile 

end  group-observations 


7.7.  Description  of  Operation 

The  following  section  steps  through  and  explains  the  operation  of  the  algo¬ 
rithms.  The  transcripts  in  Appendix  E  contain  detailed  traces  of  Explain  and  Match 
on  many  of  the  specific  examples  discussed  in  this  thesis. 

7.7.1.  Explain 

•  Check  whether  the  graph  under  construction  already  contains  a  node  of  etype  which 
exactly  matches  edescr,  and  contains  no  additional  parameters.  If  this  is  the  case,  then 
the  graph  merges  at  this  point.  Consider  the  e-graph  at  the  beginning  of  this  chapter. 
Suppose  the  left-hand  side  of  the  graph  has  been  constructed  (nodes  Nl,  N2,  N4,  N6, 
and  N7).  Search  is  proceeding  along  the  right-hand  part  of  the  graph,  through  N3. 
The  invocation  of  explain  which  created  MakeMeatDish(N5)  is  considering  abstrac¬ 
tions  of  MakeMeatDish  (see  below),  and  recursively  calls  explain  with  type  Prepare - 
Meal.  The  specific  call  would  be: 
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explain(  PrepareMeal,  {...},  {MakeMeatDish},  False,  MakeMeatDish ) 

This  description  exactly  matches  previously-created  node  N6,  which  is  returned.  Then 
N5  is  made  an  alternative  for  N6.  Thus  the  left  path  through  N2  and  N4  merges  with 
the  right  path  through  N3  and  N5.  This  kind  of  merging  can  prevent  combinatorial 
growth  in  the  size  of  the  graph. 

•  Create  a  new  node  of  type  etype,  and  link  all  the  role/value  pairs  in  edescr  to  it 

•  Check  whether  the  type  of  the  newly-created  node  is  End.  If  so,  then  no  further  ex¬ 
planation  is  possible,  and  so  return. 

•  Propagate  and  check  constraints.  Suppose  this  is  the  invocation  of  explain  which 
created  MakeSpaghetdMarinara(N2).  Edescr  is  {(step2  Nl)},  meaning  that  compo¬ 
nent  step2  of  N2  is  Nl.  The  equality  constraints  say  that  the  agent  of  any 
MakeSpaghettiMarinara  must  equal  the  agent  of  its  MakeMarinara  step.  (This  con¬ 
straint  is  inherited  from  the  more  general  one  given  for  MakePastaDish.)  If  initially 
(Nl  agent  Joe)  appears  in  the  graph,  after  this  step  (N2  agent  Joe)  also  appears. 

Fuzzy  time  constraints  are  also  propagated.  N2  is  constrained  to  occur  over  an  interval 
which  contains  the  time  of  Nl.  Suppose  the  graph  initially  contains  (Nl  time  (4  5  6 
7)).  After  this  step,  it  also  contains  (N2  time  (-°°  5  6  -h»)). 

This  step  can  also  eliminate  nodes,  marking  them  as  failed.  The  agent  of  every  spe¬ 
cialization  of  MakePastaDish  is  constrained  to  be  dexterous.  Suppose  the  general 
world  knowledge  base  contains  the  assertion  — Dexterous  (Joe).  Then  this  step  fails  for 
this  invocation,  and  explain  sets  the  status  of  N2  to  false  and  returns. 

•  Consider  Uses  of  etype.  The  calculation  of  the  set  of  Uses  is  described  in  Section 
7.3.3.  If  etype  is  MakeMarinara,  then  explain  is  recursively  invoked  with  e types  of 
MakeSpaghettiMarinara  and  MakeChickenMarinara.  The  call  to  redundant  eliminates 
uses  which  have  already  been  considered  by  alternative  paths.  Examples  follow. 

•  Explain  etype  by  considering  its  abstractions.  The  node  becomes  an  altemative-for 
its  abstractions.  Suppose  the  current  invocation  has  created  MakePastaDish(N4).  This 
step  calls 

explain!  PrepareMeal,  {(agent  Joe)  (time  (-*»  5  6  +°°))}, 

{MakePastaDish},  False,  MakePastaDish ) 

which  returns  N6. 
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Not  all  abstractions  lead  to  End;  some  are  pruned,  and  don't  appear  in  the  final  graph. 
Consider  the  invocation  which  created  MakeMarinara(Nl).  It  calls  explain  for  Make- 
Sauce.  The  only  Use  for  MakeSauce,  however,  is  (MakeSauce,  step2,  MakePas- 
taDish);  but  that  use  is  eliminated  by  the  redundant  test.  Therefore  no  node  of  type 
MakeSauce  appears  in  the  final  graph. 

•  Try  to  explain  etype  by  considering  specializations  of  etype.  This  step  is  not  per¬ 
formed  if  etype  was  reached  by  abstracting  some  other  type,  rather  than  as  a  Use. 
This  always  occurs  in  the  example.  But  suppose  explain  were  initially  invoked  with 
etype  equal  to  MakeSauce.  Then  the  specialization  MakeMarinara  is  considered.  In 
the  recursive  invocation  of  explain,  the  Uses  of  MakeMarinara  are  examined.  The  use 
(MakeMarinara,  step2,  MakeSpaghettiMarinara)  is  eliminated  by  redundant,  because 
it  specializes  (MakeSauce,  step2,  MakePas  taDish).  The  use  (MakeMarinara,  stepS, 
MakeChickenMarinara),  however,  does  lead  to  a  path  to  End. 

•  If  any  path  to  End  was  found,  the  status  of  the  node  is  set  to  True,  and  otherwise  to 
False.  Finally  the  name  of  the  node  and  its  status  are  returned. 

7.7.2.  Match 

The  following  diagram  shows  two  e-graphs,  the  first  built  from  an  observation  of 
MakeMarinara,  and  the  second  from  an  observation  of  MakeNoodles.  Match  is  ini¬ 
tially  invoked  on  the  End  nodes  of  the  two  graphs: 

match(N7,  N1 1) 

and  returns  N12,  the  End  node  of  the  combined  graph.  The  following  section  steps 
through  the  operation  of  Match. 
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End(N7) 
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figure  7.4:  E-Graphs  for  MakeMarinara,  MakeN oodles,  and  their  Match 
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•  If  the  objects  to  be  matched  are  rigid  designators,  they  must  be  lexically  identical. 

•  If  the  objects  are  fuzzy  temporal  constraints,  then  take  their  intersection.  Suppose  nl 
is  (-°°  5  6  +«),  and  n2  is  (-*»  8  7  +«).  Then  match  returns  (-°°  5  7  +<*). 

•  Check  whether  nl  and  n2  have  already  been  matched,  and  if  so,  reuse  that  value. 
Suppose  that  die  first  e-graph  in  the  diagram  above  were  matched  against  an  e-graph  of 
identical  shape;  for  example,  there  were  two  observations  of  MakeMarinara,  and  they 
may  have  been  identical.  During  the  match  down  the  left  hand  side  of  the  graphs, 
through  N4  and  N2,  MakeMarinara (N 1 )  would  match  against  the  MakeMarinara  node 
in  the  second  graph  (say,  Nl’),  resulting  in  some  final  node,  say  Nl".  Then  the  right- 
hand  side  of  the  graphs  would  match,  through  N5  and  N3.  Nl  would  match  against 
Nl'  a  second  time,  and  the  value  Nl"  would  be  used  again.  This  would  retain  the 
shape  of  the  digraph,  and  prevent  it  from  being  multiplied  out  into  a  tree. 

•  Add  a  new  node  to  the  graph,  n3,  to  represent  the  result  of  the  match,  which  is  of 
the  greatest  lower  bound  type  of  nl  and  n2.  Matching  MakeSpaghettiMarinara(N2) 
against  MakePastaDish(N9)  results  in  MakeSpaghettiMarinara(Nl5).  The  match  fails  if 
the  types  are  not  compatible.  Thus  MakeMeatDish(N5)  fails  to  match  against 
MakePastaDish(N9) 

•  Match  the  roles  of  nl  and  n2.  If  a  role  value  is  defined  for  one  node  but  not  the 
other,  match  the  role  against  itself  in  order  to  simply  copy  the  structure  into  the  result¬ 
ing  graph.  This  what  happens  when  N2  matches  N9,  yielding  N15.  The  new  node 
gets  both  the  step2  role  value  from  N2  (a  copy  of  Nl,  which  is  N17)  and  the  stepl 
role  value  from  N9  (a  copy  of  N8,  which  is  N16).  If  N2  had  the  role  stepl  defined, 
that  value  would  have  had  to  match  against  N8. 

Suppose  both  End  nodes,  N7  and  Nil,  have  the  agent  roles  defined.  This  step 
checks  that  the  agents  are  the  same. 

•  Check  and  propagate  the  constraints  on  n3.  New  constraint  violations  may  be  de¬ 
tected  at  this  point  because  more  of  the  roles  of  the  nodes  are  filled  in.  Violations  of 
temporal  orderings  between  steps  are  detected  at  this  point  The  transcripts  contain  an 
example  where  an  e-graph  based  on  Boil  fails  to  match  against  one  based  on 
MakeNoodles,  because  the  Boil  occurred  before  the  MakeN oodles. 

•  Try  matching  every  alternative  fornl  against  every  alternative  for  n2.  The  success¬ 
ful  matches  are  alternatives  for  n3.  If  one  of  the  nodes  has  some  alternatives,  but  the 
other  does  not  then  match  the  alternatives  for  the  former  against  the  latter  directly. 
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This  occurs  in  the  example  above.  MakePastaDish(N4)  matches  against  MakePas- 
taDish(N9).  N4  has  the  alternative  N2,  but  N9  has  none.  Therefore  MakeSpaghetti- 
Marinara(N2)  matches  against  MakePastaDish(N9)  as  well. 

If  there  were  some  alternatives  but  all  matches  failed,  then  n3  fails  as  well. 

•  Return  n3  and  whether  or  not  the  match  succeeded. 

7.7.3.  Group 

The  various  versions  of  group-observations  are  considerably  simpler  than  the  pre¬ 
vious  algorithms. 

The  Non-Dichronic  Version  maintains  a  set  of  all  possible  groupings  of  the  obser¬ 
vations.  It  functions  as  follows: 

•  Observe  an  event  of  etype  satisfying  edescr. 

•  Generate  obs-graph  by  explain. 

•  Conjoin  obs-graph  with  each  hypothesis  consisting  of  a  set  of  e- 

graphs.  Thus  obs-graph  is  considered  to  be  unrelated  to  the 
previous  observations. 

•  Try  to  match  obs-graph  with  each  e-graph  in  each  hypothesis. 

Thus  obs-graph  is  considered  to  be  related  to  a  previous  ob¬ 
servation. 

•  The  current  (mc-entailed)  conclusion  corresponds  to  the  disjunction  of 

all  hypotheses  of  minimum  size. 

The  Incremental  Minimization  Version  works  as  follows: 

•  Observe  an  event  of  etype  satisfying  edescr. 

•  Generate  obs-graph  by  explain. 

•  Try  to  match  obs-graph  with  each  e-graph  in  each  hypothesis. 

Thus  obs-graph  is  considered  to  be  related  to  a  previous  ob¬ 
servation.  Each  successful  match  leads  to  a  new  hypothesis. 
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•  If  obs-graph  cannot  be  matched  with  previous  e-graphs,  then  con¬ 
join  obs-graph  with  each  hypothesis. 


•  The  current  (imc-entailed)  conclusion  corresponds  to  the  disjunction 

of  ail  hypotheses,  all  of  which  are  of  same  size. 

Finally,  the  Sticky  Version  works  as  follows: 

•  Observe  an  event  of  etype  satisfying  edescr. 

•  Generate  obs-graph  by  explain. 

•  Try  to  match  obs-graph  with  each  e-graph  in  the  (single)  hypothe¬ 

sis,  in  chronological  order,  from  most  to  least  recent  As  soon 
as  a  match  succeeds,  update  the  hypothesis,  removing  the  old 
matching  e-graph,  and  adding  the  result  of  the  match  as  the  most 
recent  e-graph. 


•  If  obs-graph  cannot  be  matched  with  any  previous  e-graphs,  then 

conjoin  obs-graph  with  the  hypothesis,  as  the  most  recent  e- 
graph. 

•  The  current  conclusion  corresponds  to  the  single  hypotheses,  a  con¬ 

junction  of  e-graphs. 


7.8.  Completeness  &  Correctness 

The  necessarily  limited  nature  of  the  constraint  checking  performed  by  the  algo¬ 
rithms,  together  with  the  open-ended  nature  of  the  plan  recognition  problem,  makes  it 
impossible  to  guarantee  that  the  algorithms  will  arrive  at  the  strongest  conclusions 
justified  by  the  theory  of  mc-entailmenL  The  algorithms  are,  however,  correct:  the  in¬ 
terpretation  of  their  outputs  holds  in  all  minimum-covering  models.  This  most  easily 
seen  by  noting  that  the  algorithms  implement  the  proof  rules  discussed  in  Chapters  3 
and  4. 


The  key  to  the  correctness  of  explain  is  the  fact  that  the  algorithm  searches 
from  the  input  event  type  to  every  possible  use  of  events  of  that  type,  as  specified  by 
Theorem  3.1 1.  (A  precise  statement  and  proof  of  this  fact,  which  demonstrates  that  the 
set  Uses  and  the  redundant  check  are  properly  formulated,  appears  in  Appendix  F.) 
Each  path  leads  to  a  different  disjunctive  conclusion,  which  may  be  collapsed  with 
others  via  the  altemative-for  arcs. 
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The  match  algorithm  simply  propagates  an  equality  assertion  between  terms. 
If  two  terms  nl  and  n2  are  equal,  then  the  value  of  a  function  applied  to  the  first  is  the 
same  as  the  value  of  that  function  applied  to  the  second.  If  nl  is  equal  to  one  of  a  set  of 
alternatives,  and  n2  is  equal  to  one  of  another  set  of  alternatives,  then  certainly  at  least 
one  alternative  from  the  first  set  is  equal  to  some  alternative  from  the  second.  The  Dis- 
jointedness  Assumptions  justify  concluding  that  nl  and  n2  are  not  equal  if  they  are  not 
of  compatible  types. 

7.9.  Complexity 

One  straightforward  measure  of  the  complexity  of  the  algorithms  is  the  size  of 
largest  data  structure  possibly  produced.  A  more  detailed  analysis  would  measure  time 
complexity  as  well;  but  since  the  algorithms  visit  each  node  a  fixed  number  of  times, 
we  can  taity  space  complexity  as  a  rough  measure  of  time  complexity  as  well.  (A  pos¬ 
sible  exception  is  the  test  to  handle  the  multi-path  problem  in  the  multiple-inheritance 
version  of  explain-observation.  The  cost  of  the  best  algorithm  to  perform  this  test  is 
not  obvious.) 

The  size  of  an  e-graph,  |G|,  is  the  number  of  nodes  in  G.  The  expression 
"Algorithm  X  is  of  0(f( a))"  in  the  -following  sections  means  that  the  largest  data 
structure  constructed  by  X  on  input  n  is  no  larger  than  c*f(n),  for  some  fixed  constant 
c.  The  actual  worst-case  complexity  of  X  may,  of  course,  be  smaller  than  f(n).  The 
expression  "Algorithm  X  is  no  better  than  0(f(n))"  means  that  there  is  no  g  such  that  X 
is  of  0(g(n))  and  g  is  better  than  f  by  more  than  a  constant  factor,  that  is,  it  is  not  the 
case  that 


V  k  >  0 . 3  m  ,Vn,B>m3  g(n)  <  k*f(n) 

7.9.1.  Explain 

Suppose  that  the  events  described  by  H  contain  no  parameters.  Then  explain- 
observation  is  of  0([He1),  since  the  wont  case  would  lead  to  an  exhaustive  search  of  ail 
of  H.  But  suppose  explain-observation  did  not  merge  search  paths  at  abstraction 
points.  Then  the  algorithm  would  be  no  better  than  0(2I^eI),  because  an  event  can 
have  several  different  components  of  the  same  type.  The  figure  below  is  a  case  where 
such  combinatorial  blowup  could  occur. 
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figure  75:  Combinatorialfy  Explosive  Hierarchy 

This  example  shows  the  importance  of  maintaining  disjunctions  within  each  e-graph, 
using  the  altemadve-for  relation,  rather  than  creating  separate  databases  for  each  alter¬ 
native. 

If  parameters  (such  as  agent,  time,  etc.)  do  appear,  the  search  space  will  be 
larger.  During  the  construction  of  the  e-graph,  however,  very  few  of  these  roles  will 
have  known  values.  Thus  the  first  step  in  explain  will  very  frequently  find  a  similar 
node  and  cut  off  search. 

7.9.2.  Match 

Calling  match-graphs(Gi,G2)  frequently  returns  a  graph  which  is  smaller  than 
either  Gi  or  G2.  Unfortunately,  sometimes  the  resulting  graph  can  be  larger.  This  oc¬ 
curs  when  two  nodes  are  matched  which  have  several  alternatives,  all  of  which  are 
mutually  compatible.  Therefore  the  worst  case  complexity  of  match-graphs(G  1  ,G2)  is 
OflGimG2l). 

There  cannot  be  more  than  IG1HG2I  +  !Gil  +  IG2I  nodes  in  the  result,  because 
every  pair  of  nodes  always  matches  to  the  same  node.  The  last  two  addends  enter  in 
because  a  node  may  also  match  against  itself.  Without  the  use  of  the  cache  of  previous 
results,  performance  could  be  much  worse.  The  resulting  graph  would  always  be  a 
tree,  and  the  algorithm  would  be  no  better  than  0(2l^il*l^2l). 


7.9.3. 


Group 


Some  of  the  most  intimidating  complexity  results  arise  from  the  group-observa¬ 
tions  algorithm.  The  non-dichronic  algorithm  finds  all  consistent  partitions  of  the  set  of 
observations.  In  the  worst  case,  all  partitions  are  possible  -  any  subset  of  the  observed 
events  could  be  part  of  the  same  End  event  Then  Hypoths  will  contain  B(n)  hypothe¬ 
ses,  where  n  is  the  number  of  observations.  There  is  no  simple  closed  form  for  B(n), 
but  it  is  easy  to  show  that 


so  that  group-observations  could  contain  at  least  2n  hypotheses. 

Together  with  the  worst-case  estimates  for  explain  and  match,  Hypoths  could 
total  more  than  0(2n|H£|n)  nodes.  Of  course,  if  all  the  observations  could  be  part  of 
the  same  End  event,  then  minimum-hypos  will  return  a  single  hypothesis  consisting  of 
a  single  e-graph  which  incorporates  all  the  observations. 

The  basic  dichronic  algorithm  does  much  better  in  this  case.  If  all  the  observed 
events  could  be  part  of  the  same  hypothesis,  then  Hypoths  will  be  of  sire  1.  This 
worst-case  size  of  this  hypothesis  is  0(|Hgjn).  This  may  not  seem  like  much  of  an  im¬ 
provement,  but  remember  that  the  |H£|n  is  a  very  unlikely  upper  bound  -  it  would  only 
hold  if  all  the  observations  were  not  telling  us  anything  definite!  -  while  the  2n  factor  in 
the  non-dichronic  algorithm  would  be  accurate  in  the  common  case  of  all  the  observa¬ 
tions  being  possibly  related. 

Once  several  End  events  are  required  to  account  for  all  the  observations,  how¬ 
ever,  the  dichronic  algorithm  could  fall  prey  to  the  same  sort  of  combinatorial  explo¬ 
sion.  Thus  a  truly  practical  system  may  need  to  resort  to  the  sticky  dichronic  algorithm. 
Because  it  develops  a  single  hypothesis,  the  sticky  algorithm  has  an  absolute  upper 
bound  of  0(|HE|n)  (and  will  usually  do  much  better).  Sometimes  the  sticky  algorithm 
will  fail  to  find  the  correct  -  or  any  -  interpretation  of  the  data.  An  endless  number  of 
more  or  less  general  and  mere  or  less  efficient  algorithms  lie  between  the  non-dichronic 
and  sticky  versions. 
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7.10.  Transcripts 

The  explain-observation  and  match-graphs  algorithms  have  been  imple¬ 
mented  in  Common  Lisp,  and  appear  practical  in  small  domains.  The  recognition 
problems  discussed  in  this  thesis  all  run  in  a  few  seconds.  Appendix  E  contains  tran¬ 
scripts  of  the  programs  in  operation. 
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Chapter  8 
Conclusions 

8 . 1  The  Three  Levels 

[Marr  82]  argues  that  research  in  A.L  should  analyze  each  problem  at  three  dis¬ 
tinct  levels.  The  computational  theory  states  the  goal  of  the  computation,  why  it  is  ap¬ 
propriate,  and  provides  an  abstract  mapping  from  the  input  information  to  the  output 
The  level  of  representation  and  algorithms  provides  a  particular  representation  for  the 
input  and  output,  and  an  algorithm  for  the  transformation.  The  hardware  implementa¬ 
tion  shows  how  the  algorithm  can  be  realized  physically.  Quite  often  the  computation¬ 
al  level  is  overlooked;  the  result,  as  Marr  demonstrates  with  many  examples  from  the 
study  of  vision,  are  algorithms  which  do  not  actually  solve  the  intended  problem,  and 
are  based  on  incorrect  assumptions.  Neglecting  the  algorithmic  theory  is  dangerous  in 
another  way:  one  can  develop  elegant  formal  theories  which  would  require  infinite  time 
and  resources  to  implement 

Our  framework  includes  model,  proof,  and  algorithmic  theories.  How  do  these 
relate  to  the  three  levels?  The  model  theory  provides  a  mapping  from  models  of  the  in¬ 
put  to  models  of  the  conclusions;  thus  it  is  a  pure  information  to  information  mapping, 
a  computational  level  theory  of  the  most  abstract  kind.  The  proof  theory  is  a  computa¬ 
tional  theory  of  a  more  concrete  kind.  It  provides  a  non-deterministic  procedure 
(namely,  the  generation  of  the  non-monotonic  assumptions,  followed  by  the  application 
of  deductive  rules)  that  transforms  a  first-order  representation  of  the  input  to  that  of  the 
output  Thus  the  proof  theory  stands  between  Mart's  computational  and  algorithmic 
levels.  Finally  our  algorithmic  theory  fills  the  algorithmic  level,  by  providing  a  deter¬ 
ministic  procedure  which  may  be  directly  programmed  on  a  computer.  We  have  noth¬ 
ing  to  say  about  the  hardware  level,  how  anything  like  our  theory  could  be  realized  in  a 
brain.  Practically  all  work  that  suggests  how  high-level  inference  could  be  performed 
by  neurons  is  fueled  mostly  by  (often  ingenious)  speculation;  but  see  [Feldman  81]  for 
some  proposals  in  this  direction. 

Having  completed  this  thesis,  there  is  no  doubt  in  my  mind  that  the  most  diffi¬ 
cult  task  was  to  define  the  computational  theory  at  the  most  abstract  level.  Once  the 
rather  amorphous  problem  of  "plan  recognition”  was  tied  down,  work  could  proceed  in 
earnest  As  the  literature  review  in  Chapter  1  revealed,  similar  proposals  and  heuristics 
for  plan  recognition  problems  have  been  hashed  over  for  years,  since  the  early  work  of 
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Schmidt  and  Cenesereth.  The  minimal  model  construct  we've  developed  suggests 
what  some  of  these  heuristics  are  heuristics  for.  The  lack  of  a  clear  problem  statement 
is  no  doubt  one  of  the  reasons  for  the  stagnation  in  work  in  story  understanding  in  the 
past  decade. 

8.2  Applicability 

Our  framework  makes  explicit  the  assumptions  of  Exhaustiveness,  Disjointed¬ 
ness,  and  Component/Use  Completeness  which  underlie  our  theory  of  plan  recogni¬ 
tion,  as  well  as  most  previous  work.  These  assumptions  simplify  the  problem,  and  re¬ 
strict  its  applicability.  Do  these  assumptions  limit  our  work  to  toy  micro-worlds?1 

Every  day  new  kinds  of  events  occur,  and  yet  they  do  not  baffle  us.  An  intelli¬ 
gent  agent  cannot  rely  only  on  a  recognition  system;  it  must  contain  a  learning  compo¬ 
nent  as  welL  Earlier  we  suggested  that  it  might  be  appropriate  to  invoke  the  learning 
module  when  recognition  failed.  The  ability  of  our  framework  to  handle  levels  of  ab¬ 
straction  provides  a  crucial  feature  needed  for  learning.  For  example,  we  might  be  able 
to  conclude  that  the  agent  is  cooking  some  kind  of  pasta  dish,  even  if  we  can  rule  out 
any  of  the  particular  pasta  dishes  in  our  library.  It  may  be  possible  to  modify  the 
minimal  model  construction  so  that  the  Exhaustiveness  assumption  is  weakened,  and 
thus  allow  the  recognition  and  learning  modules  to  be  integrated. 

None  the  less,  much  of  what  a  plan  recognition  system  must  handle  is  routine. 
A  more  serious  problem  is  that  of  recognizing  erroneous  plans.  Our  framework  as¬ 
sumes  all  plans  are  internally  consistent,  and  that  all  acts  are  purposeful.  Yet  real  peo¬ 
ple  make  frequently  make  planning  errors  and  change  their  minds  in  midcourse. 

One  class  of  errors  can  be  handled  without  significant  change  to  our  frame¬ 
work.  Suppose  every  basic  event  type  is  given  two  new  specializations,  one  of  which 
also  specializes  a  new  type  called  Error.  The  diagram  below  illustrates  how  a  hierarchy 
would  be  transformed. 


1  [Dreyfus  85]  argues  that  this  criticism  applies  to  work  in  A.l.  in  general. 
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Then  each  observed  event  could  either  be  an  End  event  on  its  own  by  being  an  Error,  or 
it  could  be  part  of  an  End  event  in  a  correct  plan  from  the  original  hierarchy.  Minimiz¬ 
ing  the  number  of  End  events  prefers  interpretations  in  which  Errors  do  not  occur;  but 
observations  sequences  which  must  be  part  of  erroneous  plans  can  still  be  recognized. 


8.3  Generality  &  Extensibility 

Despite  die  limitations  outlined  above,  the  generality  of  our  framework  must  be 
stressed.  We  do  not  assume  that  there  is  a  single  plan  underway,  which  can  be 
uniquely  identified  from  the  first  input  We  do  not  assume  that  the  sequence  of  obser¬ 
vations  is  complete.2  Finally,  we  do  not  assume  that  all  the  steps  in  a  plan  are  linearly 
ordered.  Indeed,  we  know  of  no  other  implemented  plan  recognition  system  which 
handles  arbitrary  temporal  relations  between  steps. 

The  logical  basis  for  die  system  makes  it  dear  what  conclusions  should  be 
reached  on  the  basis  of  quantified  and  disjunctive  information.  Section  2.7  showed 


^Completeness  of  the  observations  is  a  nice  property  to  exploit  when  it  is  available.  [Segre  86]  has 
built  a  plan  recognition  system  for  use  in  programming  a  robot  arm.  The  operator  guides  the  arm 
manually,  and  the  system  recognizes  what  plans  are  being  performed.  No  kind  of  non-deductive 
inference  is  needed:  because  every  step  of  the  plan  is  input,  the  fact  that  the  plan  occurs  logically 
follows. 
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how  plans  involving  conditional  actions  could  be  represented  by  event  hierarchies. 
Previous  plan  recognition  systems  did  not  handle  any  of  these  situations. 

The  generality  of  the  system  suggests  that  it  may  be  extended  in  various  ways  in 
a  straightforward  manner-  An  important  one  for  plan  recognition  is  the  ability  to  deal 
with  plans  containing  iteration  and  recursion.  For  example,  a  plan  to  pick  up  all  the 
blue  blocks  might  be  encoded  as  follows: 

Vx  .  pickup-blues(x)  d  Vy.  block(y)  a  blue(y)  d 

pickup(s(y,x))  a  object(s(y,x))=y 

That  is,  if  an  instance  of  picking  up  blue  blocks  occurs,  then  for  every  block  that  is 
blue,  an  instance  of  pickup  occurs  with  that  block  as  its  object  We  have  not  yet  inves¬ 
tigated  whether  this  extension  to  the  form  of  the  event  hierarchy  requires  modification 
in  the  model  theory.  Another  area  for  future  work  is  to  extend  our  notion  of  an  event 
hierarchy  to  include  hierarchies  with  exceptions  [Touretzky  84]. 

Finally  we  need  to  consider  principles  other  than  Occam's  Razor  for  determin-  . 
ing  the  class  of  preferred  interpretations.  Rather  than  ordering  models  during  the 
minimization  process  by  the  number  of  End  events,  one  may  wish  to  include  other  fac¬ 
tors,  such  as  qualitative  likelihoods  of  various  event  types.  It  remains  to  be  seen  how 
far  the  general  framework  can  be  pushed,  or  if  other,  more  quantitative  measures  be¬ 
long  in  a  separate  theory,  which  applies  to  the  conclusions  of  our  system. 

8.4  Two  Unresolved  Issues 

After  reviewing  the  strong  and  weak  points  of  our  work,  a  couple  of  unresolved 
issues  remain.  The  first  is  the  question  of  scale-up.  As  discussed  in  Chapter  7,  the 
worst-case  behavior  of  our  recognition  algorithms  can  be  bad,  although  proper  struc¬ 
turing  of  the  hierarchy  greatly  reduces  search.  How  much  of  a  problem  will  this  be  in 
more  realistically-sized  knowledge  bases?  Experience  in  creating  and  examining  the 
structure  of  larger  domains  is  necessary. 

A  more  philosophical  problem  is  the  whole  issue  of  what  serves  as  primitive 
input  to  the  recognition  system.  Throughout  this  thesis  we've  assumed  that  arbitrary 
high-level  descriptions  of  events  are  simply  presented  to  the  recognizer.  This  assump¬ 
tion  is  reasonable  in  many  domains,  such  as  understanding  written  stories,  or  observ¬ 
ing  the  words  typed  by  a  computer  operator  at  a  terminal.  But  a  real  plan  recognizer  -  a 
person  -  doesn’t  always  get  his  or  her  input  in  this  way.  How  are  visual  impressions 
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of  simple  bodily  motions  -  John  is  moving  his  hands  in  such  and  such  a  manner  - 
translate  into  the  impression  that  John  is  rolling  out  dough  to  make  pasta?  There  is  a 
great  deal  of  work  in  low-level  perception,  and  a  great  deal  in  high  level  recognition  - 
including  this  thesis.  The  semantic  gap  between  the  output  of  the  low-level  processes 
and  the  high-level  inference  engines  remains  wide,  and  few  have  ventured  to  cross  it 
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Appendix  A 
Chapter  3  Proofs 

Theorem  3.1  (Exhaustiveness) 

Suppose  {Ei,  E2, ... ,  EjJ  are  all  the  predicates  directly  abstracted  by  Eo  in  H^.  Then 
the  statement: 

Vx  .  Eo(x)  zd  (Ei(x)  v  E2(x)  v  ...  v  En(x)) 
is  true  in  all  models  of  Ha  which  are  closed  under  specialization.  The  statement  is  also 
true  in  all  A-closed  models  of  H. 

Proof 

Let  Mi  be  a  model  of  Ha  in  which  the  statement  does  not  hold.  We  prove  that  Mi 
cannot  be  closed  under  specialization. 

For  the  statement  to  be  false  in  M^  there  must  be  some  :C  such  that 
Eq(x)  A  -'Ei(x)  a  ...  a  -iEn(x) 
is  true  in  Mi{x/:C}.  Define  M2  by 
Domain  (M2)  =»  Domain  (M 1) 

M2[Z1 »  Mi[Z]  for  Z  *  Eg 
M2[Eo]-Mi(Eo]-{:C} 

That  is,  M2  is  the  same  as  Mi,  except  that  :C  «s  M2[Eq]. 

We  claim  that  M2  is  a  model  of  Ha-  Every  axiom  that  does  not  contain  Eo  is  true  in 
Mi,  and  therefore  receives  the  same  valuation  in  M2.  So  consider  axioms  that  contain 
Eq.  Axioms  of  the  form: 

Vx .  Eq(x)  zd  Ej(x)  i  *  0 

are  false  only  if  there  is  a  :D  such  that 
:D  €  M2[EqJ  a  :D  «  M2[Ei] 

But  we  know  that 
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:D  €  M2[Hq]  ^ 

:D  €  MiCEq]  =* 

:D  6  Mi[E[]  => 

:D  c  MjCEj] 

which  is  a  contradiction.  Therefore  there  can  be  no  such  :D,  and  the  axiom  holds  in 
M* 

The  only  other  axioms  containing  Eq  are  of  the  form 
Vx  .  Ej(x)  3  Eq(x)  1  £  i  £  n 

Again,  this  fails  in  M2  if  there  is  a  :D  such  that 
:D  €  M2[Ej]  a  :D  €  M2[Eq] 

If  :D  *  :C,  then 

:D  €  M2[Ei]  ■=> 

:D  €  Mi[Ej]  => 

:D  €  Mj[Eq]  => 

:De  M2[E()] 

which  is  a  contradiction.  Finally,  if  :D  =  :C,  then,  since  :C  e  Mj[Ei], 

35  €  MitEJ  ao 
05  €  M2[Ej] 

which  also  is  a  contradiction.  Therefore  there  can  be  no  such  :D.  So  M2  is  a  model  of 

ha- 

Furthermore,  Mi  and  M2  agree  on  all  terms  and  predicates  outside  of  Hg-Hgg.  (Eq 
cannot  be  a  basic  event  type.)  M2  defeats  Mi's  candidacy  for  minimality  in  Hg-Hgg. 
Therefore  the  statement  in  Theorem  3.1  must  hold  in  all  models  closed  under 
specialization.  Since  A-closed  models  are  a  subset  of  those  closed  under  specialization, 
the  statement  also  holds  in  all  A -dosed  models. 

Q.E.D. 


Theorem  3.2 
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Let  EXA  be  the  set  of  all  statements  which  instantiate  Theorem  3. 1  for  a  particular  H. 
If  Mj  is  a  model  of  H^  u  EXA  such  that  :C  s  Mi  [Eg],  then  there  is  a  basic  event 
type  Efe  €  Hf~r  such  that  :C  €  and  Eg  abstracts*  Efc. 


Proof 

By  induction.  Define  a  partial  ordering  over  He  by  Ej  <  E^  iff  Efc  abstracts  Ej. 
Suppose  E()  €  Heb*  Then  Eo  abstracts*  Eg.  Otherwise,  suppose  the  lemma  holds 
for  all  Ej  <  Eg.  Since 

Vx  .  Eg(x)  =>  (Ei(x)  v  E2(x)  v  ...  v  E^x)) 
and  :C  €  Mi(Eq],  it  must  the  case  that 

:C  €  MifEj]  v  ...  v  :C  6  Mi[En] 

WLG,  suppose  :C  6  Mi[Ei].  Then  there  is  an  Ej,  such  that  Ei  abstracts*  E^  and 
:C  €  M1OE5].  Since  Eg  abstracts  Ei,  it  also  abstracts*  E^. 


Q.E.D. 


Theorem  3.3 

Every  event  in  a  model  closed  under  specialization  is  of  at  least  one  basic  type.  That  is, 
if  Mi  is  a  model  of  closed  under  specialization  such  that  :C  e  Mi  [Eg],  then  there 
is  a  basic  event  type  E},  e  Heb  suc^  ^at  e  MitEfc]  and  Eg  abstracts*  E^. 

Proof 

By  Theorem  3,1,  H^, «. 1  EXA  holds  in  M^  and  by  Theorem  3.2  there  is  such  an  E^. 

Q.E.D. 
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Theorem  3.4 


is  a  model  of  HA  closed  under  specialization  if  and  only  if  Mx  is  a  model  of 
Ha  u  EXA. 

Proof 

The  "only  if’  half  follows  from  Theorem  3.1.  We  prove  that  if  is  a  model  of 
Ha  u  EXA,  then  M  is  closed  under  specialization. 

Suppose  not  Then  there  is  an  M2  which  defeats  M}.  M2  and  Mi  agree  on  Hgg,  but 
there  is  (at  least  one)  Eq  €  Hr-Hf.b  and  event  :C  such  that 
:C  e  Mi[Eq]  a  :C  «  M2[Eol 
By  Theorem  3.2,  there  is  an  Eb  €  Hgg  such  that 
:C€Mx[Eb] 

Since  Mi  and  M2  agree  on  Hfb, 

:C  €  M2[Eb] 

But  then  because  Eq  abstracts  Eb 
:C  €  M2[Eq] 

which  is  a  contradiction.  Therefore  there  can  be  no  such  M2.  Thus  Mj  is  closed  under 
specialization. 

Q.E.D. 

Theorem  3.5  (Disjointedness) 

If  event  predicates  Ex  and  E2  are  not  compatible,  then  the  statement: 

Vx  .  -iEx(x)  v  -iE2(x) 

is  true  in  all  models  of  HA  which  are  closed  under  abstraction.  The  statement  is  also 
true  in  all  A-closed  models  of  H. 
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Proof 


Suppose  Mi  is  a  model  of  Ha  closed  under  specialization  in  which  the  statement  is 
false.  We  prove  that  M}  is  not  closed  under  abstraction. 

So  let  :C  be  an  event  such  that 
Ei(x)  a  E2(x) 

is  true  in  Ml{x/:C},  where  Ej  and  E2  are  not  compatible.  Using  Theorem  3.3,  let  E5 
be  a  basic  event  type  abstracted  by  E}  such  that  :C  €  MifE^].  Define  M2  as  follows. 
Domain(M2)  »  Domain(Mi) 

M2[Z]«M1[Z3forZ*  HE 
MifEil*  MjCEj]  if  E{  abstracts*  Ei, 

Mi(Ej]-{:C}  otherwise 

In  particular,  note  that  Mi[AnyEvent]=>M2[  Any  Event],  since  AnyEvent  certainly 
abstracts  Ejj.  We  claim  that  M2  is  a  model  of  Ha  closed  under  specialization. 

(Proof  that  M2  is  a  model  ofH &) 

Suppose  M2  is  not  a  model  of  H^;  in  particular,  suppose  the  axiom 
Vx .  Ej(x)  3  Ej(x) 

is  false  in  M2.  Since  it  is  true  in  Mj,  and  M2  differs  from  Mj  only  in  the  absence  of  :C 

from  the  extension  of  some  event  predicates,  it  must  be  the  case  that 
:C  €  M2[Ej]  a  :C  «£  M2[Ej] 

while 

:C  6  Mj[Ej]  a  :C  e  Mj[Ej] 

By  the  definition  of  M2,  it  must  be  the  case  that  Ej  abstracts*  E^.  Since  Ej  abstracts  Ej, 
then  Ej  abstracts*  Efe  as  welL  But  then  M}  and  M2  would  have  to  agree  on  Ej;  that  is, 
:C  €  M2[EiJ 

which  is  a  contradiction.  So  M2  must  be  a  model  of  Ha  after  all. 

(Proof  that  M2  is  closed  under  specialization) 

By  Theorem  3.4,  M2  is  closed  under  specialization  if  it  is  a  model  of  EXA.  So 
suppose  it  is  not  a  model  of  EXA;  in  particular,  suppose 


Vx  .  Ejo(x)  =>  (Eji(x)  v  Ej2(x)  v  ...  v  Ejn(x)) 
is  false.  Then  it  must  be  the  case  that 

:C  €  M2[Ej’oJ  a  :C  «  M2fEji3  a  ...  a  :C  «  M2fEjn] 

But  :C  6  M2[Ejo3  means  that  Ejo  abstracts*  E5.  Since  Ejo  is  not  basic,  at  least  one  of 

Eji . Ejn  abstracts  E^.  WLG,  suppose  it  is  Ejj.  Then 

:C  €  M2PE5]  =» 

:C  e  M2[Ej!3 

which  is  a  contradiction.  Therefore  all  members  of  EXA  hold  in  M2,  so  M2  is  closed 
under  specialization. 

(Conclusion  of  proof) 

Furthermore,  M 1  and  M2  agree  on  all  terms  and  predicates  outside  of 
Hjr-{AnyEvent}.  Because  the  extension  of  E2  in  M2  is  a  proper  subset  of  the 
extension  of  E2  in  Mj,  M2  defeats  Mi's  candidacy  for  minimality  in  Hg-{AnyEvent} 
among  models  closed  under  specialization.  Thus  the  statement  in  Theorem  3.5  holds  in 
all  models  closed  under  abstraction,  and  the  subset  of  A-closed  models. 


Q.E.D. 


Theorem  3.6 

Let  DJA  be  die  set  of  all  statements  which  instantiate  Theorem  3.5  for  a  particular  H.  If 
Mi  is  a  model  of  u  EXA  u  DJA  such  that  :C  6  Mi  [Eg],  then  there  is  a  unique 
basic  event  type  E5  such  that  :C  e  Mi[Ejj].  Any  event  type  which  holds  of  :C 
abstracts*  Ej,. 

Proof 

By  Theorem  3.2  there  must  be  at  least  one  Ej,  e  Hgg  such  that  :C  s  Mi[E{j].  By 
definition  any  two  distinct  basic  event  types  are  not  compatible;  thus  DJA  contains  an 
axiom  of  the  form 
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Vx  .  -lEb(x)  V  -nEj,' 

for  all  other  basic  event  types  Ej,1.  Therefore  E5  is  unique.  By  Theorem  3.2  any  event 
type  which  holds  of  :C  must  abstract*  E5. 

Q.E.D. 


Theorem  3.7  (Unique  Basic  Types) 

If  Mi  is  a  model  of  closed  under  abstraction  containing  event  token  :Cj,  then  there 
is  a  unique  basic  event  type  Ej,  such  that  :C  6  Mj[Ei,].  Any  event  type  which  holds  of 
:C  abstracts*  E^. 

Proof 

By  Theorems  3.1  and  3.5,  Ml  is  a  model  of  u  EXA  u  DJA.  By  Theorem  3.6 
there  is  such  an  E^. 


Q.E.D. 


Theorem  3.8 

If  Mi  is  an  model  of  u  EXA  u  DJA,  then  Mi  is  a  model  of  closed  under 
abstraction. 

Proof 

Suppose  not;  then  there  is  an  M2  closed  under  specialization  which  defeats  Mi  for 
minimality  in  Hg-{  Any  Event}.  Mi  and  M2  agree  on  AnyEvent,  but  there  exists  (at 
least  one)  Eq  e  Hg-f  AnyEvent}  and  event  :C  such  that 
:C  e  Mi[E0]  a  :C  «  M2[EoJ 
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Now 


:C  6  Mi[Eq]  =* 

:C  €  Mi  [Any  Event]  => 

:C  e  M2[AnyEvent] 

By  Theorem  3.3  there  is  some  Eb  e  Hgg  such  that 
:C  €  M2[Ejj] 

Since  M2  defeats  Mi, 

:C  €  MiCEjj] 

By  Theorem  3.6,  Eb  is  the  unique  basic  type  of  :C  in  Mj,  and  Eq  abstracts*  Eb.  But 
Eq  abstracts*  Eb  means  that 
:C  6  M2[Eb]  =* 

:C  €  M2[Eq] 

which  is  a  contradiction.  Therefore  there  can  be  no  such  M2,  and  Mi  must  be  closed 
under  abstraction. 


Q.E.D. 


Theorem  3.9  (Abstraction  Completeness) 

Mi  is  an  A-closed  model  of  H  if  and  only  if  Mi  is  a  model  of  H  u  EXA  u  DJA. 

Proof 

If  Mi  is  an  A-closed  model  of  H,  then  it  is  also  a  model  of  closed  under 
abstraction,  so  by  Theorems  3.1  and  3 3  it  is  a  model  of  H  u  EXA  u  DJA.  If  Mi  is 
a  model  of  H  vj  EXA  u  DJA,  then  it  is  also  a  model  of  u  EXA  u  DJA,  so  by 
Theorem  3.8  it  is  a  model  of  closed  under  abstraction.  Since  it  is  also  a  model  of 
H,  it  is  an  A-closed  model  of  H. 

Q.E.D. 
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Theorem  3.10  (No  Useless  Events) 

Let  Mi  be  a  covering  model  of  H,  containing  event  token  :Ci-  Then  either 
:Ci  €  Mi[End]  is  true,  or  there  exists  some  event  token  :C2  such  that  :Ci  is  a  direct 
component  of  :C2- 

Proof 

Suppose  the  lemma  is  false:  M  is  a  covering  model, 

:Ci  6  Mi[Ei] 

:Ci  «  Mi  [End] 

and  there  does  noi  exist  event  token  which  has  :Ci  as  a  direct  component  Define 
as  follows. 

M2[Z]  »  MifZ]  for  Z  €  Hg 
M2[E]  -  Mi[EH:Cl}  ’for  E  e  Hg 

Note  that  Mi  and  M2  agree  on  End.  If  M2  is  an  A-closed  model  of  H^,  then  M2 
defeats  Mi  for  minimality  in  Hg-{End}.  We  will  prove  this  by  showing  that 
H  u  EXA  u  DJA  holds  in  M2.  We  consider  each  of  the  types  of  axioms  in  turn. 

(Case  1)  Axioms  in  H<j  must  hold,  because  they  receive  the  same  valuation  in  Mi  and 

m2. 

(Case  2)  Axioms  in  are  of  the  form: 

Vx  .  Ej(x)  D  Ei(x) 

Suppose  one  is  false;  then  for  some  :D, 

:D  €  M2[Ej]  a  :D  €  M2[Ej] 

But  this  is  impossible,  because  Mi  and  M2  must  agree  when  :D  *  :Ci,  as  must  be 
case,  because  :Ci  does  not  appear  in  the  extension  of  any  event  type  in  M2. 

(Case  3)  Axioms  in  EXA  must  hold  by  the  same  argument 

(Case  4 )  Axioms  in  DJA  must  hold  because  they  contain  no  positive  uses  of  Hg. 

(Case  5)  The  j-th  axiom  in  Hp  is  of  the  form: 
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Vx  .  Ejo(x)  o  Eji(fji(x))  a  Ej2(fj2(x))  a  ...  a  Ejn(fjn(x))  a  ic 
Suppose  it  does  not  hold  in  M2.  Then  there  must  be  some  :C2  such  that 
EjoOO  a  {  -iEji(fji(x))  v 
-iEj2(fj2U))  v  ...  v 
^EjnCfjnfr))  v 
-.K} 

is  true  in  M2(x/:C2}.  Mi  and  M2  agree  on  k,  so  it  must  the  case  that  for  some  ji, 
M2[fji](:C2)  «  M2CEji] 

while 

MitfjiK^eMxDEji] 

Because  Mi  and  M2  differ  on  Eji  only  at  :Ci,  it  must  be  the  case  that 

Mi[fji](C2)-:Ci 

But  then  XT  1  is  a  component  of  :C2  in  Mi,  contrary  to  our  original  assumption. 

(End  of  Cases) 

By  Theorem  3.9  M2  is  A-closed,  and  so  it  defeats  Mi.  This  contradiction  proves  the 
theorem. 

Q.E.D. 


Theorem  3.11  (Component/Use) 

Let  E  6  He,  and  Com(E)  be  the  set  of  event  predicates  with  which  E  is  compatible. 

Consider  all  die  decomposition  axioms  in  which  any  element  of  Com(E)  appears  on  the 
right-hand  side.  The  j-th  such  decomposition  axiom  has  the  following  form,  where  Eji 

is  the  element  of  Com(E): 

Vx  .  Ejo(x)  o  Eji(fji(x))  a  ...  a  Eji(fji(x))  a  ...  a  Ejn(fjn(x))  a  k 
Suppose  that  the  series  of  these  axioms,  where  an  axiom  is  repeated  as  many  times  as 
there  are  members  of  Com(E)  in  its  right-hand  side,  is  of  length  m  >  0.  Then  the 
following  statement  is  c-valid: 
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Vx  .  E(x)  3  End(x)  v 

(3y  .  Elfo(y)  a  fii(y)-x)  v 
(3y  .  E2,o(y)  a  f2i(y)*x)  v 
...  v 

(3y  .  Ej^oCy)  a  fnu(y)*x) 


Proof 

Let  M  be  a  covering  model  such  that  :C  e  M[E]  and  :C  <s  M[End].  By  Theorem  3.10 
there  is  an  Ejo,  Ejj,  and  T)  such  that 
:D  e  M[Ejo] 

:C  e  M[EjiJ 
.*C  »  M[fji](:D) 

where  fjj  is  a  role  function  in  a  decomposition  axiom  for  Ejq.  By  Theorem  3.5,  E  and 
Ejj  are  compatible.  By  inspection  we  see  that  the  second  half  of  the  formula  above  is 
true  in  M  when  x  is  bound  to  :C,  because  the  disjunct  containing  Eji  is  true  when  the 
variable  y  is  bound  to  :D.  Since  the  choice  of  :C  was  arbitrary,  the  entire  formula  is 
true  in  M.  Finally,  since  M  could  be  any  covering  model,  the  formula  is  c-vaiid. 

Q.E.D. 


Theorem  3.12 

Let  CUA  be  the  set  of  all  formulas  which  instantiate  Theorem  3.1 1  for  a  particular  H. 
If  Mi  is  a  model  of  H  u  EXA  u  DJA  u  CUA  such  that  :C  6  Mi[E],  then  there  is 
a  :Cn  such  that  :Cn  e  Mi  [End]  and  :G  is  a  component*  of  :Cn. 

Proof 

If  :Ci  6  Ml  [End],  then  :Ci  is  a  component*  of  :Cn  =  :Ci.  So  suppose 
:Ci  €  Mi  [End].  For  the  axioms  in  CUA  to  hold  there  must  be  sequences  of  event 
tokens,  types,  and  role  functions  of  the  following  form: 
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:Ci  :Ci  :C3  :C3 

=C5 

=C5 

Ei  E2  E3  E4 

e5 

E6 

f3i 

f5i 

such  that 

For  all  j,  :Cj  €  Mi[Ej] 

For  odd  j,  Ej  and  Ej+i  are  compatible 

For  odd  j,  j  2  3,  Ej_i  is  a  direct  component  of  Ej,  and  :Cj.2  *  Mj  [fjj](:Cj) 

This  sequence  must  terminate  with  a  :CQ  such  that  :Cn  €  Mi  [End].  Otherwise,  some 
Ej  must  appear  more  than  once  in  the  sequence,  which  would  mean  that  a  cycle  exists  in 
H;  but  we  have  supposed  that  H  is  acyclic.  Therefore  :C  i  is  a  component*  of  Cn. 

Q.E.D. 


Theorem  3.13  (No  Infinite  Chains) 

If  M'i  is  a  covering  model  of  H  such  that  Cj  6  Mj[E],  then  there  is  a  :Cn  such  that 
:Cn  e  Mj  [End]  and  :Ci  is  a  component*  of  :Cn. 

Proof 

By  Theorems  3.9  and  3.8  Mi  is  a  model  of  H  u  EXA  vj  DJA  u  CUA;  by  Theorem 
3.12  there  is  such  a  :Cn. 

Q.E.D. 


Theorem  3.14  (Decomposition  Completeness) 

Mi  is  a  covering  model  of  H  if  and  only  if  Mi  is  a  model  of 
H  u  EXA  u  DJA  u  CUA. 
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Proof 


The  "only  if*  half  follows  from  Theorems  3.9  and  3.11.  We  prove  that  if  Mi  is  a 
model  of  H  vj  EXA  u  DJA  u  CUA,  then  it  is  a  covering  model. 

Suppose  not  By  Theorem  3.9,  is  A-closed,  so  there  must  be  an  A-closed  M2 
which  defeats  for  minimality  in  H£-{End}.  Mi  and  M2  agree  on  all  functions  and 
predicates  outside  of  Hg— [End},  but  there  exists  (at  least  one)  Ei  e  Hg-{End}  and 
event  :Ci  such  that 

:Ci  €  Mi[Ei] 

:Ci  €  M2DE1] 

By  Theorem  3. 12  there  is  a  :Cn  such  that 
:Cn  6  Mi  [End] 

and  :Ci  is  a  component*  of  :Cn.  By  the  construction  used  in  the  proof  of  Theorem 

3.12  there  are  sequences 

•Ci  :Ci  '.C3  :C3  ...  iCj  :Cn 

Ei  E2  E3  E4  ...  En  En+i * End 

*3i  •••  fni 

such  that 

For  all  j,  1  £  j  £  n+1,  :Cj  e  Mi[Ej] 

For  odd  j,  Ej  and  Ej+i  are  compatible 
For  odd  j,  3  £  j  £  n 

Ej.  1  is  a  direct  component  of  Ej 
•Cj-2  -  MiffjiJCCj)  =  M2[fjiK:Cj) 


Because  Mi  and  M2  agree  on  End, 

:Cn  6  M2[End] 

Now  for  any  odd  j,  3  £  j  £  n,  if 
•Cj  6  M2[Ej] 

then  since  Hq  holds  in  M2, 

•■Cj-2  e  M2tEj-i] 

If  we  prove  that  for  all  odd  j,  1  £  j  £  n 

:Cj  e  M2[Ej+i]  s*  :Cj  s  M2[Ej] 
we  will  be  done;  because  we  would  then  know  that 
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:Cn  €  M2[End]  ^ 

:Cjj  €  M2[En+i]  ^ 

:Cn  e  M2[En]  =* 

:Cn.2  €  M2[En.1]  => 

•.C3  6  M2[E3]  => 

:Ci  €  M2[E2l  =* 

:Ci  6  M2[Ei]  => 

which  yields  the  desired  contradiction.  So  lets  prove  that 
:Cj  €  M2[Ej+i]  =»  :Cj  e  M2[Ej] 

Assume  the  antecedent 
:Cj  €  M2[Ej+i] 

Because  M2  is  A-closed,  there  is  a  unique  Efc  e  Hgg  such  that 
:Cj  €  M2[Eb3 
Because  M2  defeats  Mj, 

:Cj  €  Mi[E|,3 

Since  Mj  is  A-closed,  Theorem  3.7  tells  us  that  because 
:Cj  €  Mj[Ej] 

it  must  be  the  case  that  Ej  abstracts*  Efc.  But  then  since  holds  in  M2, 
:Cj  e  M2[Ej] 
and  we  are  done. 


Q.E.D. 


Theorem  3.15  (Computability  of  C-Entailment) 

There  is  a  computable  function  cl  which  maps  a  hierarchy  H  into  a  set  of  axioms  with 
the  property  that 

Qh^cP 

if  and  only  if 

cl(H)  uQ  bP 
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Proof 


The  function  simply  computes  H  u  EXA  u  DJA  u  CUA,  which  is  a  finite, 
recursively  enumerable  set  Theorem  3. 14  guarantees  the  equivalence. 


Theorem  3.16  (Not  Predicate  Completion) 

Theorem  3.1 1  cannot  be  strengthened  by  considering  only  axioms  in  which  E  appears 
as  a  component,  instead  of  all  axioms  in  which  event  types  compatible  with  E  appear  as 
components. 

Proof 

Consider  the  following  hierarchy: 

ha 

Vx .  Ej(x)  3  AnyEvent(x) 

Vx .  E2(x)  3  AnyEvent(x) 

Vx  .  E3(x)  3  Ei(x) 

Vx  .  E3(x)  3  E2(x) 

Vx .  End(x)  3  AnyEvent(x) 

Vx .  E4(x)  3  End(x) 

Vx  .  E5(x)  3  End(x) 

hd 

Vx  .  E4(x)  3  Ej(f4(x))  a  P 
Vx  .  E5(x)  3  E2<f5(x))  a  Q 

The  modified  statement  would  not  hold  in  the  covering  model 
{  AnyEvent(D),  End(D),  E4(D),  Ei(f4(D)), 

E3(f4(D)),  E2(f4(T)))>  AnyEvent(f4(D))  } 
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because  even  though  E^(f4(D))  is  true,  there  is  no  instance  of  E5. 


Q.E.D. 


Theorem  3.17  (C-entailment  and  Circumscription) 

For  a  given  a  hierarchy  H,  a  statement  P  is  c-entailed  by  Q  if  and  only  if  P  follows  from 
the  following  schema: 

Q  a  Circum(  H  a  Circum(  Circum(  )» 

{AnyEvent}), 

H£-{End}) 

Note  that  the  two  inner  circumscriptions  apply  only  to  the  abstraction  hierarchy,  while 
the  final  circumscription  applies  to  all  of  H. 

Proof 

See  [Etherington  86]  for  proof  of  the  correctness  of  circumscription.  Since  the  result  of 
all  the  minimizations  can  be  described  by  a  finite  set  of  axioms,  we  strongly  suspect 
that  circumscription  is  complete  in  this  case.  (No  proof,  however,  has  appeared  in  the 
literature.) 

Q.E.D. 
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Appendix  B 
Chapter  4  Proofs 


Theorem  4. 1  (Minimum  Cardinality  Defaults) 

Consider  the  following  sequences  of  statements. 

MAq.  Vx  .  -iEnd(x) 

MAj.  Vx,y  .  End(x)  a  End(y)  =>  x=*y 
MA2-  Vx,y,z  .  End(x)  a  End(y)  a  End(z) 

3  (x=»y)  v  (x=z)  v  (y=z) 

•  •  • 

The  first  asserts  that  no  End  events  exist;  the  second,  no  more  than  one  End  event 
exists;  the  third,  no  more  than  two;  and  so  on.  Let  P  be  any  sentence,  T  any  sentence 
or  set  of  sentences,  and  H  a  hierarchy.  Suppose  there  is  a  minimum  covering  model  in 
which  the  extension  of  End  is  finite.  Then 
r  jj  Knc  Q 

if  and  only  if 

T  u  cl(H)  u  MAj  l-  Q 

where  i  is  the  smallest  integer  such  that  left-hand  side  of  the  provability  relation  is 
consistent 

Proof 

We'll  prove  that  Mi  is  a  minimum  cover  of  T  relative  to  H  (with  finite  extension  of 
End)  if  and  only  if  Mi  is  a  model  of  T  u  cl(H)  u  MAp  The  theorem  then  follows 
from  completeness  and  correctness  of  first-order  logic. 

Suppose  Mi  is  a  model  of  T  u  cl(H)  u  MAp  By  Theorem  3.14,  Mi  is  a  covering 
model.  Clearly  |Mi[End]|  £  i.  Suppose  that  Mi  were  defeated  for  minimum 
cardinality  in  End  by  M2.  Then  |M2[End]|  =  j  <  i.  But  by  Theorem  3.14  M2  is  a 
model  of  T  u  cl(H).  Because  |M2[End]|  =  j,  M2  is  also  a  model  of  MAj.  But  this  is 
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impossible,  because  we've  assumed  that  r  u  ci(H)  u  MAj  is  inconsistent.  So  Mi 
cannot  be  defeated,  and  must  be  a  minimum  cover. 

Suppose  Mi  is  such  a  minimum  cover,  where  |Mi[End]|  =  n.  As  before,  Mi  must  be 
model  of  T  u  cl(H).  Suppose  it  were  not  a  model  of  MAj.  Then  |Mi[End]|>i.  But 
because  T  u  cl(H)  u  MA*  is  consistent,  it  has  a  model  M2,  which  is  also  be  a 
covering  model,  with  iM2[End]i  £  i.  But  this  is  impossible,  because  M2  would 
defeat  M^  Therefore  MAj  must  also  be  true  in  M^ 

Q.E.D. 


Theorem  4.2  (Cardinality  Circumscription) 

Let  a  include  all  the  predicate,  function,  and  constant  symbols  in  our  language  other 
than  End.  Suppose  that  all  models  of  H  are  infinite,  and  in  some  model  of  Tu  cl(H), 
End  has  a  finite  extension.  If 

CircumOT  u  cl(H),  {End},  a)  1-  Q 

then 

I"  U  hmc  ^ 

where  CircumOT  u  cl(H),  {End},  a)  means  to  circumscribe  with  a  varying.  The  "if 
is  strengthened  to  "if  and  only  if’  if  it  is  true  that  circumscription  is  complete  in  this 
case. 
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Proof 


We  prove  that  Mi  is  a  minimum  covering  model  of  T  relative  to  H  if  and  only  if  Mj  is 
minimal  in  End  among  models  of  T  u  cl(H)  where  a  varies.  The  theorem  then 
follows  from  correctness  of  circumscription.  The  strengthened  version  of  the  theorem 
depends  upon  the  truth  of  the  proposition  that  circumscription  is  complete  when  the 
result  of  the  circumscription  is  equivalent  to  a  finite  set  of  first-order  formulas. 
Although  this  proposition  is  very  likely  to  be  true,  no  proof  has  yet  been  discovered 
[Etherington  87]. 

(if)  Suppose  Mi  is  a  minimum  covering  model  of  T.  By  Theorem  3.14  Mi  is  a  model 
of  T  u  cl(H).  Suppose  Mi  were  not  minimal  in  End  where  a  varies;  that  is,  there  is 
an  M2  such  that 

1.  M2  is  a  model  of  T  u  cl(H) 

2.  domain(Mi)  -  domain(M2) 

3.  M2[End]  cMifEnd] 

But  (1)  implies  M2  is  a  covering  model,  and  (2)  implies  that  |M2[End]|  <  |Mi[End]|,  so 
M2  would  defeat  Mi’s  candidacy  as  a  minimum  covering  model.  So  there  can  be  no 
such  M2,  and  Mi  is  minimal  in  End  where  a  varies. 

(only  if)  Suppose  Mi  is  minimal  in  End  among  models  of  T  u  cl(H)  where  a  varies. 
Suppose  Mi  is  not  a  minimum  covering  model  of  T.  Since  Mi  is  a  covering  model, 
there  must  be  an  M2  which  defeats  Mi's  candidacy  to  be  a  minimum  covering  model, 
with 

|M2[End]|  <  |Mi[End]| 

We  will  construct  a  model  which  defeats  Mi's  candidacy  for  minimality  in  End  where 
a  varies,  yielding  the  desired  contradiction.  But  M2  itself  is  not  suitable,  because  its 
domain  and  M^s  domain  may  differ,  and  may  not  even  be  of  the  same  size.  So 
consider  the  following  two  cases: 

1.  Suppose  |Domain(Mi)|  £  |Domain(M2)|.  Apply  the  Downward  Tarski  - 
Lowenheim-Skolem  Theorem  [Barwise  77]  and  let  M3  be  an  elementary 
submodel  of  M2  such  that  iDomain(Mi)|  =  |Domain(M3)|. 
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2.  Otherwise  |Domain(Mi)|  >  |Domain(M2)|.  Apply  the  Upward  Tarski- 
Ldwenheim-Skolem  Theorem  and  let  M3  be  an  elementary  extension  of  M2 
such  that  |Domain(Mi)|  =  [Domain(M3)|. 

Because  M3  is  an  elementary  submodel  or  extension,  and  M2  is  a  model  of  T  u  cl(H), 
M3  is  a  model  of  Tu  cl(H).  Furthermore,  we  have  assumed  that  |M2[End]|  is  finite, 
let's  say  of  size  n.  Then  the  sentence  MAn  is  true  in  M2,  and  therefore  also  in  M3. 
This  means  that  |M3[End]|  £  |M2[End]|.  Thus  M3  is  a  covering  model  which  defeats 
Mi's  candidacy  to  be  a  minimum  covering  model. 

Let  M4  be  a  homomorphism  of  M3  such  that  Domain (M4)=Domain(M  j ).  Now  we 
have  a  model  of  Tu  cl(H)  which  has  the  same  domain  as  Mi,  but  is  smaller  in  the 

size  of  its  extension  of  End.  But  we  need  a  model  where  the  extension  of  End  which  is 
a  subset  of  that  in  M^ 

Suppose 

Mi[End]  -  {  :Ai,  :A2,  ...  :Aj,  :Ci,  :C2, ...  :C fc,  :Ck+i, ...  } 

M4(End]  »  {  :Ai,  :A2, ...  :Aj,  :Bi,  :B2, ...  rB^  } 

That  is,  :Ai  through  :Aj  (where  possibly  j=0)  are  the  elements  in  End  that  Mj  and  M2 
have  in  common.  Elements  :Ci  and  up  are  in  End  only  in  Mi,  and  :B  1  through  rB^  are 
only  in  End  in  M4.  The  postfix  substitution  operator  {:A/:B}  replaces  all  instances  of 
:  A  by  instances  of  :B.  Let  8  be  the  substitution  which  swaps  all  the  :Bi  with  :Ci. 

0  =  (:Bi/:Ci, ...  tBic/tCjc,  :Ci/:Bi, ...  :Ck/:BjJ 
Define  M5  as  follows. 

M5[p]  »  M2[p]0  for  all  predicates  p 
M5(c]  »  M2[c]0  for  all  constants  c 
M5[f]  »  Xxi,x2, ...  xn  .  (M2[f](xi  0,  X2  0, ...  xn  0))0 
for  all  n-ary  functions  f 

M5  is  a  homomorphism  of  M4,  and  is  therefore  a  model  of  T  u  cl(H).  The  proper 

subset  condition  is  satisified  because 

MjlEnd]  *  {:Ai, ...  :Aj,  :Ci, ...  ’.C^}  c  Mi[End] 


7A-D-  162 


Because  cc  includes  all  symbols  other  than  End,  Mi  and  M5  do  not  have  to  agree  on  the 
interpretation  of  any  symbols.  So  M5  defeats  Mi's  candidacy  for  minimality  in  End 
where  ct  varies,  which  is  a  contradiction.  Therefore  Mi  is  a  minimum  covering  model 
ofr. 

Q.E.D. 
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Appendix  C 
Chapter  5  Proofs 


Theorem  5.1  (Non-Universal  Conclusions) 

Let  Q  be  a  sentence  which,  when  written  with  all  quantifiers  in  initial  position,  contains 
no  universal  quantifiers.  Then 

r  h  ^  ®cs  ^ 

if  and  only  if 

T  H  Knc  ^ 


Proof 

(Only  if)  Trivial,  since  a  minimum  cover  is  an  mcs-model. 

(If)  Q  can  be  written 

3  xi, ...  xn  .  <o 

where  ©  is  a  sentence  containing  no  quantifiers  in  which  x  j ,  ...  xn  appear  free. 
Suppose  Q  holds  in  all  minimum  covers.  Let  be  any  mcs-model.  Then  Mj 
contains  submodel  M2  in  which 
3  xi, ...  xn  .  © 

But  this  is  only  the  case  if  there  are 
:Ci, ...  :C0  €  Domain(M2) 

such  that  ©  is  true  in  M2(xi/:Ci,  ...  xn/:Cn}.  A  straightforward  inductive  argument 
shows  that  ©  is  true  in  Mifxj /:C\, ...  xn/:Cn}  because  all  terms  in  ©  are  interpreted 
by  Mi(xi/:Ci, ...  xn/:Cn}  as  individuals  in  the  domain  of  M2.  Therefore  Q  holds  in 
Mi,  and  in  all  mcs-models. 

Q.E.D. 
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Theorem  5.2  (Incremental  Recognition) 

In  the  case  of  a  single  observation,  imc-entailment  is  the  same  as  mcs-entailmenL 

(F l)  jj  ^imc  ^ 
if  and  only  if 

jj  ^ mcs  ^ 

In  the  case  of  multiple  observations,  imc-entailment  is  monotonic. 

(Fi,  •••  Fn.i)  ^imc  ^ 

implies 

(Ti, ...  rn_i,  Tn)  y  ^imc  ^ 

Proof 

The  first  part  of  the  theorem  follows  from  the  base  case  of  the  definition  of  imc- 
entailment  The  second  part  of  the  theorem  follows  from  the  fact  that  an  incremental 
minimum  cover  of  (Tj, ...  Tn_i,  Tn)  must  also  be  an  incremental  minimum  cover  of 
(Ti, ...  Tn_  i).  Therefore  if  Q  holds  in  all  models  of  the  latter  sort  it  must  also  hold  in 
all  models  of  the  former  sort 
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Appendix  D 

Temporal  Constraint  Logic 

The  following  rules  translate  Allen's  temporal  interval  operators  into  an  algebra 
on  fuzzy  interval  constraints.  A  time  interval  is  represented  by  a  term  which  is 
interpreted  as  a  pair  of  real  numbers.  The  functions  first-instant  and  last-instant 
apply  to  a  interval  and  yield  the  lower  and  upper  bounds  respectively.  A  fuzzy  interval 
constraint  is  a  tuple  of  four  real  numbers.  A  time  interval  satisfies  a  fuzzy  constraint, 
written 

I  e  (start-min,  start-max,  end- min.  end-max) 

if  the  first  instance  of  I  falls  between  start-min  and  start-max  inclusive,  and  the  final 
instance  of  I  falls  between  end-min  and  end-max  inclusion.  That  is, 

(start-min  £  first-instant(I)  £  start-max)  a 
(end-min  £  last-instant(I)  £  end-max) 

While  start-min,  start-max,  etc.,  are  represented  by  numerals,  first-instant(I)  and  last- 
instant(T)  remain  symbolic  quantities. 

The  form  of  each  each  rule  is  as  follows:  given  that  Tj  satisfies  fuzzy 
constraint  Z\,  and  T2  satisfies  fuzzy  constraint  Z2,  and  Tj  is  related  by  a  given  binary- 
operator  to  T2,  then  it  must  be  the  case  that  Ti  satisfies  fuzzy  constraint  Z3.  The 
logical  form  of  the  rules  is  as  follows,  where  Z3  =  FtransitiveCZj,  binary-op,  Z2). 

Where  Tj,  T2  are  time  intervals,  and  Z\X2  are  fuzzy  constraints: 

Tj  €  Z\  a  T2  e  Z2  a  Ti  binary-op  T2  r> 

Tj  6  Ftransitive(Zi,  binary-op,  Z2) 

Suppose  that  Ti  and  T2  are  related  by  a  disjunction  of  temporal  operators.  Then  one 
calculates  Z3  for  each  alternative,  and  then  combine  all  the  answers  with  the  Funion 
function,  which  is  defined  below.  In  formal  terms: 

Tie  Zi  a  T2  €  Z2  a  Ti  (opi  op2  ...  opn)  T2  3 
Ti  e  Funion(  Ftransitive(Zi,  opi,  Z2), 

Ftransitive(Zi,  op2,  Z2), 

...  ,  Ftransitive(Zi,  opn,  Z2) ) 
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Where  Ti  (opi  op2  ...  opn)  T2  a 

(Ti  opi  T2)  v  (Ti  op2  T2)  v  ...  v  (Ti  opn  T2) 

Consider  the  case  where  an  interval  is  known  to  satisfy  two  different  fuzzy  constraints. 
Then  it  must  be  the  case  that  the  interval  satisfies  the  Fintersection  of  the  constraints. 
Formally: 

Tj  6  Zj  a  Tj  e  Z2  3  Tj  6  Fintersection(  Z\,  Z2) 

Finally,  the  predicate  empty  is  true  of  a  fuzzy  constraint  just  in  case  no  interval  could 
satisfy  it 

The  following  tables  define  the  Funion,  Fintersection,  empty  and  Ftransitive 
functions.  Their  correctness  can  be  verified  by  elementary  algebra. 
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Ftransitive(  (a  b  c  d),  op,  (e  f  g  h) )  =  (w  x  y  z) 

op 

w 

X 

y 

z 

before 

a 

11 

c 

jkkpz&m 

meets 

a 

■BSH 

max(c,e) 

a 

WSS88SSKI 

ME3ESM 

starts 

WEBBBM 

■ES3S 

max(c,e) 

■SH 

during 

max(a,e) 

min(b,h) 

finishes 

max(a,e) 

min(b,h) 

min(d,h) 

min(b,h) 

d 

E 

HKESESfli 

PjBfiBSBI 

d 

after 

M35SPB1 

b 

BBSB3BI 

d 
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Appendix  E 
Transcripts 

Following  are  transcripts  of  the  implementation  running  on  some  of  the 
examples  discussed  in  this  thesis. 


Hunt/Rob  Example 


;;  First  observation:  get-gun(GET-GUNl).  Builds  graph  named  gl. 


Command:  (EXPLAIN-OBSER V ATION  '(GET-GUN  1  GET-GUN)  ’Gl) 
T 


;;  Display  e-graph  gl. 

;;  There  are  two  alternatives,  with  GET- GUN  1  as  component  of  rob-bank  or  of  hunt 
Command:  (DRAW-GRAPH) 
graph  Gl 
END226 

?*  HUNT228 
?*  HUNT227 

SI  ->  GETGUN1 
?-  ROB-BANK225 
?«  ROB-BANK224 
Sl->  GET-GUN  1  * 

The  e-graphs  are  printed  out  linearly,  where  the  symbol 
? m  represents  an  alternative  link  and  rolename~> 
represents  a  component  (and)  link.  The  symbol  A 
indicates  a  link  to  structure  already  printed  above.  This 
description  is  equivalent  to  the  following  diagram: 
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End(N226) 


Rob-Bank(N224)  Hunt(N227) 


Get-Gun(Nl) 


figure  E.1:  E-Graph  for  Get-Gun 

The  one  difference  from  the  algorithm  described  in 
Chapter  7  is  the  inclusion  of  nodes  which  "abstract 
away "  the  steps  of  any  node  which  has  explicit 
component  links.  In  this  example,  HUNT228  abstracts 
HUNT227,  but  doesn't  include  the  component  SI. 
These  extra  nodes  do  not  change  the  semantics  of  e- 
graphs,  but  they  do  allow  a  more  compact  representation 
of  the  graphs  when  the  type  to  be  explained  can  fill 
several  different  roles  in  a  user  type.  In  such  a  case ,  a 
different  user  node  is  created  for  each  use,  but  all  can 
become  alternatives  for  the  same  abstract  node  of  that 
user  type. 


;;  Turn  on  tracing. 

Command:  (SETQ  *NOISY*  T) 
T 


;;  Second  observation:  go-to-bank(GO-TO-BANK2),  in  graph  G2. 

Command:  (EXPLAIN-OBSERVATION  ’(GO-TO-BANK2  GO-TO-B ANK)  G2) 
...  creating  node  GO-TO-B  ANK2 
...  searching  up  from  GO-TO-B ANK2 
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...  creating  node  CASH-CHECK229 

...  considering  GO-TO-BANK2  as  SI  of  CASH-CHECK229 

...  checking  constraints  on  CASH-CHECK229 

...  CASH-CHECK229  is  okay 

...  searching  up  from  CASH-CHECK229 

...  creating  node  CASH-CHECK230 

...  searching  up  from  CASH-CHECK230 

...  creating  node  END23 1 

...  searching  up  from  END231 

...  creating  node  ROB-BANK232 

...  considering  GO-TO-BANK2  as  S2  of  ROB-BANK232 

...  checking  constraints  on  ROB-B  ANK232 

...  ROB-BANK232  is  okay 

...  searching  up  from  ROB-BANK232 

...  creating  node  ROB-BANK233 

...  searching  up  from  ROB-BANK233 

...  merging  graph  at  END23 1 

T 

;;  Show  g2,  again  two  alternatives:  rob-bank  or  cash-check. 

Command:  (DRAW-GRAPH) 

graph  G2 

END231 

?-  ROB-BANK233 
?«  ROB-BANK232 
S2->  GO-TO-BANK2 
?-  CASH-CHECK230 
?-  CASH-CHECK229 
SI  ->  GO-TO-BANK2  A 


;;  Matching  the  graphs  yields  just  the  rob-bank  alternative. 
Command:  (MATCH-GRAPHS  G1  G2  ’G1+G2) 
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...  trying  to  match  END226  with  END23 1 

...  creating  node  END234 

...  checking  constraints  on  END234 

...  END234  is  okay 

...  checking  constraints  on  END234 

...  END234  is  okay 

...  trying  to  match  HUNT228  with  ROB-BANK233 

...  trying  to  match  HUNT228  with  CASH-CHECK230 

...  trying  to  match  ROB-BANK225  with  ROB-BANK233 

...  creating  node  ROB-BANK235 

...  checking  constraints  on  ROB-BANK235 

...  ROB-BANK235  is  okay 

...  checking  constraints  on  ROB-BANK235 

...  ROB-B  ANK235  is  okay 

...  trying  to  match  ROB-BANK224  with  ROB-BANK232 

...  creating  node  ROB-B  ANK236 

...  checking  constraints  on  ROB-BANK236 

...  ROB-BANK236  is  okay 

...  creating  node  GO-TO-BANK237 

...  checking  constraints  on  GO-TO-BANK237 

...  GO-TO-BANK237  is  okay 

...  checking  constraints  on  GO-TO-BANK237 

...  GO-TO-B  ANK237  is  okay 

...  copying  GO-TO-BANK2  yields  GO-TO-BANK237 

...  creating  node  GET-GUN238 

...  checking  constraints  on  GET-CUN238 

...  GET-GUN238  is  okay 

...  checking  constraints  on  GET-GUN238 

...  GET-GUN238  is  okay 

...  copying  Cl  yields  GET-GUN238 

...  checking  constraints  on  ROB-BANK236 

...  ROB-BANK236  is  okay 

...  successful  match  of  ROB-B  ANK224  and  ROB-B  ANK232  yields 
ROB-BANK236 
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...  successful  match  of  ROB-BANK225  and  ROB-BANK233  yields 
ROB-BANK235 

...  trying  to  match  ROB-BANK225  with  CASH-CHECK230 
...  successful  match  of  END226  and  END231  yields  END234 
END234 


Command:  (DRAW-GRAPH) 

graph  G1+G2 

END234 

?*  ROB-BANK235 
?»  ROB-BANK236 

51  ->  GET-GUN238 

52  ->  GO-TO-B  ANK237 


Cooking  Examples 


;;  First  observation  is  make-sauce(OBS-SAUCE2),  with  agent  Joe,  during 
;;  time  interval  beginning  between  4  and  5,  and  ending  between  6  and  7. 

Note  that  fuzzy  time  constraints  are  represented  by 
VECTORS  with  TIME  as  first  element.  The  parameters 
of  the  observation  are  specified  by  a  list  of  (role  value) 
pairs.  The  first  use  of  the  symbol  "time”  below  refers  to 
the  role  Time,  while  the  second  merely  identifies  the 
vector  value  as  a  time  constraint. 


Command:  (EXPLAIN-OBSERVATION  '(OBS-SAUCE2  MAKE-SAUCE 

(AGENT  JOE)  (TIME  #(TIME  4  5  6  7)))) 

...  creating  node  OBS-SAUCE2 

...  searching  up  from  OBS-SAUCE2 

...  creating  node  MAKE-PASTA-DISH246 

...  considering  OBS-SAUCE2  as  S2  of  MAKE-PASTA-DISH246 

...  checking  constraints  on  MAKE-PASTA-DISH246 
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...  MAKE-PAST A-DISH2 46  is  okay 
...  searching  up  from  MAKE-PASTA-DISH246 
...  creating  node  MAKE-PASTA-DISH247 
...  searching  up  from  MAKE-PAST A-D IS H247 
...  creating  node  PREPARE-MEAL248 
...  searching  up  from  FREPARE-MEAL248 
...  creating  node  END249 
...  searching  up  from  END249 
...  creating  node  MAKE-MARINARA250 
...  checking  constraints  on  MAKE -MARIN  ARA250 
...  MAKE-MARINARA250  is  okay 
...  searching  up  from  MAKE-MARINARA250 
...  creating  node  MAKE-CHICKEN-MARINARA25 1 
...  considering  MAKE-MARINARA250  as  S5  of 
MAKE-CHICKEN-MARINARA25 1 

...  checking  constraints  on  MAKE-CHICKEN-MARINARA25 1 

...  MAKE-CHICKEN-MARINARA25 1  is  okay 

...  searching  up  from  MAKE-CHICKEN-MARINARA25 1 

...  creating  node  MAKE-CHICKEN-MARINARA252 

...  searching  up  from  MAKE-CHICKEN-MARINARA252 

...  creating  node  MAKE-MEAT-DISH253 

...  searching  up  from  MAKE-MEAT-DISH253 

...  merging  graph  at  PREP ARE-MEAL2 48 

T 

;;  View  the  graph. 

The  parameters  of  each  node  appear  in  a  list  of  role/value 
pairs  after  the  node  name.  Only  parameters  for  which  a 
value  has  been  calculated  are  included. 

Command:  (DRAW-GRAPH  T) 
graph  G2 
END249  NIL 

?»  PREPARE-MEAL248  ((AGENT  JOE)  (TIME  #(TIME  -INF  5  6  +INF))) 
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?-  MAKE-MEAT-DISH253 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  6  +INF))) 

?»  MAKE-CHICKEN-MARINARA252 

((AGENT  JOE)  (TIME  #OTME  -INF  5  6  +INF))) 

?«  MAKE-CHICKEN-MARIN  ARA25 1 

((AGENT  JOE)  (TIME  #OTME  -INF  5  6  +INF))) 

S5  ->  MAKE-MARINARA250 

((AGENT  JOE)  (TIME  #(TIME  45  6  7))) 

?«  MAKE-PASTA-DISH247 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  6  +INF))) 

?-  MAKE-PASTA-DISH246 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  6  +INF))) 

S2  ->  0BS-SAUCE2  ((AGENT  JOE)  (TIME  #(TIME  4  5  6  7))) 


;;  The  second  observation  is  making  noodles. 


Command;  (EXPLAIN-OBSERVATION  ’(OBS-NOODLES3  MAKE-NOODLES 

(AGENT  JOE)  #(TIME  6  8  7  9))) 


T 


Command:  (DRAW-GRAPH  T) 
graph  G3 
END257  NIL 

?-  PREPARE-MEAL256 

((AGENT  JOE)  (TIME  #(TIME  -INF  8  7  +INF))) 

?*  MAKE-PASTA-DISH255 

((AGENT  JOE)  (TIME  #fITME  -INF  8  7  +INF))) 

?-  MAKE-PAST A-DISH254 

((AGENT  JOE)  (TIME  #(TIME  -INF  8  7  +INF))) 

SI  ->  OBS-NOODLES3  ((AGENT  JOE)  (TIME  #(TIME  6  8  7  9))) 


;;  We  can  merge  G2  and  G3  together  they  can  be  steps  of  the  same  action. 
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Command:  (MATCH-GRAPHS  'G2  G3  G2+G3) 

...  trying  to  match  END249  with  END257 

...  creating  node  END258 

...  checking  constraints  on  END 25  8 

...  END258  is  okay 

...  checking  constraints  on  END258 

...  END258  is  okay 

...  trying  to  match  PREPARE-MEAL248  with  PREPARE-MEAL256 

...  creating  node  PREP ARE-MEAL25 9 

...  checking  constraints  on  PREP ARE-MEAL25  9 

...  PREPARE-MEAL259  is  okay 

...  checking  constraints  on  PREPARE-MEAL25  9 

...  PREP ARE-MEAL25 9  is  okay 

...  trying  to  match  MAKE-MEAT-DISH253  with  MAKE-P ASTA-DISH25 5 

...  trying  to  match  MAKE-PASTAOISH247  with  MAKE-PASTA-DISH255 

...  creating  node  MAKE-P ASTA-DISH2 60 

...  checking  constraints  on  MAKE-P ASTA-DISH2 60 

...  MAKE-PASTA-DISH260  is  okay 

...  checking  constraints  on  MAKE-PAST A-DISH260 

...  MAKE-PASTA-DISH260  is  okay 

...  trying  to  match  MAKE-PAST  A— DISH246  with  MAKE-P  AST  A-DISH254 

...  creating  node  MAKE-PASTA-DISH261 

...  checking  constraints  on  MAKE-PASTA-DISH261 

...  MAKE-PASTA-DISH26 1  is  okay 

...  creating  node  MAKE-NOODLES262 

...  checking  constraints  on  MAKE-NOODLES262 

...  MAKE-NOODLES262  is  okay 

...  checking  constraints  on  MAKE-NOODLES262 

...  MAKE-NOODLES262  is  okay 

...  copying  OBS-NOODLES3  yields  MAKE-NOODLES262 

...  creating  node  MAKE-SAUCE263 

...  checking  constraints  on  MAKE-SAUCE263 

...  MAKE-SAUCE263  is  okay 

...  checking  constraints  on  MAKE-SAUCE263 
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...  MAKE-SAUCE263  is  okay 
...  copying  OBS-SAUCE2  yields  MAKE-SAUCE263 
...  checking  constraints  on  MAKE-PASTA-DISH26 1 
...  MAKE-P  ASTA-DISH26 1  is  okay 

...  successful  match  of  MAKE-P  ASTA-DISH246  and  MAKE-PAST A-DISH254 
yields  MAKE-P  ASTA-DISH260 

...  successful  match  of  MAKE-P ASTA-DISH247  and  MAKE-PASTA-DISH255 
yields  MAKE-PASTA-DISH26 1 

...  successful  match  of  PREPARE-MEAL248  and  PREP ARE-ME AL25 6  yields 
PREPARE-MEAL259 

...  successful  match  of  END249  and  END257  yields  END258 
END258 


Command:  (DRAW-GRAPH  T) 
graph  G2+G3 
END258  NIL 

?«  PREPARE-MEAL259  ((AGENT  JOE)  (TIME  #(TIME  -INF  5  7  +INF))) 
?«  MAKE-PASTA-DISH260 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  7  +INF))) 

?«  MAKE-P ASTA-DISH26 1 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  7  +INF))) 

S2  ->  MAKE-SAUCE263 

((AGENT  JOE)  (TIME  #(TEME  45  6  7))) 

SI  ->  MAKE-NOODLES262 

((AGENT  JOE)  (TIME  #(TIME  6  8  7  9))) 


;;  Now  consider  an  observation  of  make-noodles  with  a  different  agent.  The  constants 
;;  Joe  and  Sally  are  rigid  designators,  and  therefore  unequal. 


Command:  (EXPLAIN-OBSERVATION  ’(OBS-NOODLES4  MAKE-NOODLES 

(AGENT  SALLY)  #(TIME  6  8  7  9))) 


T 
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;;  Try  to  match  this  with  the  original  make-sauce.  It  will  fail,  because  agents  differ. 


Command:  (MATCH-GRAPHS  ’G2  G4  G2+G4) 

...  trying  to  match  END249  with  END267 

...  creating  node  END268 

...  checking  constraints  on  END268 

...  END268  is  okay 

...  checking  constraints  on  END268 

...  END268  is  okay 

...  trying  to  match  PREPARE-MEAL248  with  PREPARE-MEAL266 

...  creating  node  PREPARE-MEAL269 

...  equality  constraint  violated  JOE  *  SALLY 

...  PREPARE-MEAL269  fails 

...  END268  fails 

NIL 


;;  Lets  check  out  the  temporal  constraints,  now.  We'll  observe  a  boiling  event 
;;  with  time  BEFORE  the  make-noodles  event  This  conflict  will  prevent  a  match. 


Command:  (EXPLAIN-OBSERVATION  '(OBS-BOILING5  BOIL 

(TIME  #(TIME  l  1  2  2)))  ’G5) 


T 


Command:  (DRAW-GRAPH  T) 
graph  G5 
END273  NIL 

?-  PREPARE-MEAL272  ((TIME  #(TIME  -INF  1  2  +INF))) 

?»  MAKE-PAST A-DISH27 1  ((TIME  #(TIME  -INF  1  2  +INF))) 

?-  MAKE-PASTA-DISH270  ((TIME  #(TIME  -INF  1  2  +INF))) 
S3  ->  OBS-BOILING5  ((TIME  #(TIME  1  1  2  2))) 


;;  Try  to  match  this  with  G2  and  G3.  This  will  fail  because  of  temporal  constraint 
violation. 
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Command:  <N*ATCH-<3RAPHS  'G2+G3  G5  'G2+G3+G5) 

...  trying  to  match  END 25 8  with  END273 

...  creating  node  END274 

...  checking  constraints  on  END274 

...  END274  is  okay 

...  checking  constraints  on  END274 

...  END274  is  okay 

...  trying  to  match  PREPARE-MEAL259  with  PREPARE-MEAL272 

...  creating  node  PREP ARE-ME AL2 7 5 

...  checking  constraints  on  PREP  ARE-ME  AL275 

...  PREPARE-MEAL275  is  okay 

...  checking  constraints  on  PREP  ARE-ME  AL275 

...  PREPARE-MEAL275  is  okay 

...  trying  to  match  MAKE-PASTA-DISH260  with  MAKE-PASTA-DISH27 1 

...  creating  node  MAKE-PASTA-DISH276 

...  checking  constraints  on  MAKE-PAST A-DISH276 

...  MAKE-PASTA-DISH276  is  okay 

...  checking  constraints  on  MAKE-PASTA-DISH276 

...  MAKE-PA  ST  A-DISH276  is  okay 

...  trying  to  match  MAKE-PAST A-DISH26 1  with  MAKE-PAST A-DISH270 

...  creating  node  MAKE-PAST A-D I SH2 77 

...  checking  constraints  on  MAKE-PASTA-DISH277 

...  MAKE-PAST A-DISH277  is  okay 

...  creating  node  MAKE-NOODLES278 

...  checking  constraints  on  MAKE-NOODLES278 

...  MAKE-NOODLES278  is  okay 

...  checking  constraints  on  MAKE-NOODLES278 

...  MAKE-NOODLES278  is  okay 

...  copying  MAKE-NOODLES262  yields  MAKE-NOODLES278 

...  creating  node  MAKE-SAUCE279 

...  checking  constraints  on  MAKE-SAUCE279 

...  MAKE-SAUCE279  is  okay 

...  checking  constraints  on  MAXE-SAUCE279 
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...  MAKE-SAUCE279  is  okay 

...  copying  MAKE-SAUCE263  yields  MAKE-SAUCE279 

...  creating  node  BOIL280 

...  checking  constraints  on  BOIL280 

...  BOIL280  is  okay 

...  checking  constraints  on  BOIL280 

...  BOIL280  is  okay 

...  copying  OBS-BOILING5  yields  BOIL280 

...  checking  constraints  on  MAKE -PAST A-DISH2  77 

...  time  constraint  violated  #(TIME  6  8  7  9)  BEFOREMEET  #(TIME  1122) 

...  MAKE-PASTA-DISH277  fails 

...  MAKE-PASTA-DISH276  fails 

...  PREPARE-MEAL275  fails 

...  END274  fails 

NIL 


;;  Now  let's  find  a  LATER  boiling  event.  It  will  match  okay. 


Command;  (EXPLAIN-OBSERVATION  '(OBS-BOHJNG6  BOIL 

(TIME  #(TTME  9  10  11  12)))  ’G6) 


T 


Command:  (MATCH-GRAPHS  'G2+G3  'G6  ’G2+G3+G6) 
END285 


Command:  (DRAW-GRAPH  T) 
graph  G2+G3+G6 
END285  NIL 

?-  PREPARE-MEAL286  ((AGENT  JOE)  (TIME  #(TIME  -INF  5  1 1  +INF))) 
?=  MAKE-PASTA-DISH287 

((AGENT  JOE)  (TIME  #(TIME  -INF  5  1 1  +INF))) 

?-  MAKE-PAST  A-DISH288 
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((AGENT  JOE)  (TIME  #(TIME  -INF  5  1 1  +INF)) 
S3  ->  BOEL291  ((TIME  #(TIME  9  10  1 1  12))) 

S2  ->  MAKE-S  AUCE290 

((AGENT  JOE)  (TIME  #(TIME  45  6  7))) 
SI  ->  MAKE-NOODLES289 

((AGENT  JOE)  (TIN r  #(TIME  6  8  7  9))) 


Operating  System  Examples 


;;  User  enters:  %  copy  f  oo  bar. 


Command:  (EXPT  AJN-OBSERVATION  '(OBS-COPYl  COPY 

(OLD  FOO)  (NEW  BAR))  'Cl) 


T 


;;  Could  be  part  of  rename  by  copy,  or  of  modify  file. 


Command:  (DRAW-GRAPH  T) 
graph  Cl 
END294  NIL 

?*  RENAME297  ((NEW  BAR)  (OLD  FOO)) 

?=  RENAME-BY-COPY296  ((NEW  BAR)  (OLD  FOO)) 

?«  RENAME-BY-COPY295  ((NEW  BAR)  (OLD  FOO)) 

COFY-ORIG^STEP ->  OBS-COPYl  ((NEW  BAR)  (OLD  FOO)) 
?*  MODEFY293  ((FILE  FOO)) 

?-  MODEFY292  ((FILE  FOO)) 

BACKUP-STEP  ->  OBS-COPYl  ((NEW  BAR)  (OLD  FOO))  A 


;;  User  enters:  %  copy  jack  sprat. 


Command-  (EXPLAIN-OBSERVATION  '(OBS-COPY2  COPY 

(OLD  JACK)  (NEW  SPRAT))  ’C2) 
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T 

;;  Try  (and  fail)  to  unify  these  commands.  File  names  are  rigid  designators, 
;;  and  so  cannot  be  matched  unless  identical. 

Command:  (MATCH-GRAPHS  'Cl  ’C2  ’C1+C2) 

...  trying  to  match  END294  with  END30Q 

...  creating  node  END304 

...  checking  constraints  on  END304 

...  END304  is  okay 

...  checking  constraints  on  END304 

...  END304  is  okay 

...  trying  to  match  RENAME297  with  RENAME303 
...  creating  node  RENAME305 
...  equality  constraint  violated  BAR  *  SPRAT 
...  RENAME305  fails 

...  trying  to  match  RENAME297  with  MODEFY299 

...  trying  to  match  MODIFY293  with  RENAME303 

...  trying  to  match  MODIFY293  with  MODIFY299 

...  creating  node  MODEFY306 

...  equality  constraint  violated  FOO  *  JACK 

...  MODDFY306  fails 

...  END304  fails 

NIL 


;;  So,  there  must  be  two  different  plans  going  on. 
;;  User  enters:  %  delete  foo 


Command:  (EXPLAIN-OBSERVATION  '(OBS-DELETE3  DELETE 

(FILE  FOO))  ’C3) 


T 
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Command:  (DRAW-GRAPH  T) 
graph  C3 
END310  NIL 

?*  MODIFY312  NIL 
?»  MODIFY3 1 1  NIL 

DELETE-BACKUP-STEP  ->  OBS-DELETE3  ((FILE  FOO)) 
?«  RENAME309  ((OLD  FOO)) 

?-  RENAME-BY-COPY308  ((OLD  FOO)) 

?=  RENAME-B  Y-COPY307  ((OLD  FOO)) 

DELETE-ORIG-STEP  ->  OBS-DELETE3  ((FILE  FOO))  A 


;;  This  delete  can  unify  with  command  1,  but  not  command  2 


Command:  (MATCH-GRAPHS  'Cl  'C3  *C1+C3) 
END3I3 


Command:  (DRAW-GRAPH  T) 
graph  C1+C3 
END313  NIL 

?-  RENAME314  ((NEW  BAR)  (OLD  FOO)) 

?*  RENAME-B Y-C0PY3 15  ((NEW  BAR)  (OLD  FOO)) 

?-  RENAME-B Y-C0PY3 16  ((NEW  BAR)  (OLD  FOO)) 
DELETE-ORIG-STEP  ->  DELETE318  ((FILE  FOO)) 
COPY-ORIG-STEP  ->  COPY317  ((NEW  B  AR)  (OLD  FOO)) 


;;  Command  2  and  3  cannot  be  part  of  the  end  plan 

Command:  (MATCH-GRAPHS  'C2  'C3  C2+C3) 
...  trying  to  match  END300  with  END310 
...  creating  node  END321 
...  checking  constraints  on  END321 
...  END321  is  okay 
...  checking' constraints  on  END321 
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...  END321  is  okay 

...  trying  to  match  RENAME303  with  MODIFY312 
...  trying  to  match  RENAME303  with  RENAME309 
...  creating  node  RENAME322 
...  equality  constraint  violated  JACK  =*  FOO 
...  RENAME322  fails 

...  trying  to  match  MODIFY299  with  MODIFY3 12 

...  creating  node  MODIFY323 

...  checking  constraints  on  MODIFY323 

...  MODIFY323  is  okay 

...  checking  constraints  on  MODIFY323 

...  MODIFY323  is  okay 

...  trying  to  match  MODIFY298  with  MODEFY3 1 1 

...  creating  node  MODIFY324 

...  checking  constraints  on  MODIFY324 

...  MODIFY324  is  okay 

...  creating  node  COFY325 

...  checking  constraints  on  COPY325 

...  COPY325  is  okay 

...  checking  constraints  on  COPY325 

...  COPY325  is  okay 

...  copying  OBS-COPY2  yields  COPY325 

...  creating  node  DELETE326 

...  checking  constraints  on  DELETE326 

...  DELETE326  is  okay 

...  checking  constraints  on  DEUETE326 

...  DELETE326  is  okay 

...  copying  OBS-DELETE3  yields  DELETE326 

...  checking  constraints  on  MODIFY324 

...  equality  constraint  violated  FOO  -  SPRAT 

...  MODIFY324  fails 

...  MODIFY324  fails 

...  MODIFY323  fails 

...  trying  to  match  MODIFY299  with  RENAME309 
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...  END321  fails 
NIL 


Language  Examples 

;;  Joe  says  to  Sally:  "Can  you  give  me  the  salt?' 

Command:  (EXPLAIN-OBSERVATION  '(TALK1  SURFACE-QUESTION 

(SPEAKER  JOE) 

(HEARER  SALLY) 

(TIME  #CTTME  445  5)) 

(CONTENT  (CAN  SALLY  (GAVE  SALLY  JOE  SALT)))) 
TJTTER1) 

...  creating  node  TALK1 

...  searching  up  fromTALKl 

...  creating  node  DIRECT-REQUEST360 

...  considering  TALK1  as  S  of  DIRECT-REQUEST360 

...  checking  constraints  on  DIRECT-REQUEST360 

...  DIRECT-REQUEST360  Is  okay 

...  searching  up  from  DDRECT-REQUEST360 

...  creating  node  DIRECT-REQUEST36 1 

...  searching  up  from  DIRECT-REQUEST361 

...  creating  node  REQUEST362 

...  searching  up  from  REQUEST362 

...  creating  node  OBTAIN-BY-ASKING363 

...  considering  REQUEST362  as  SI  of  OBTAIN-BY-ASKING363 

...  checking  constraints  on  OBTAIN-BY-ASKING363 

...  equality  constraint  violated  GAVE  ■  INFORMED  IF 

...  OBTAIN-BY-ASKING363  fails 

...  OBTAIN-BY-ASKING363  fails 

...  creating  node  FTNDOUT-BY-ASKING364 

...  considering  REQUEST362  as  SI  of  HNDOUT-BY-ASKING364 


7A-D-  185 


...  checking  constraints  on  F1NDOUT-BY-ASKING364 

...  FINDOUT-BY-ASKING364  is  okay 

...  searching  up  from  FINDOUT-BY-ASKING364 

...  creating  node  FINDOUT-B  Y-ASKING365 

...  searching  up  from  FINDOUT-BY-ASKING365 

...  creating  node  FINDOUT366 

...  searching  up  from  FINDOUT366 

...  creating  node  END367 

...  searching  up  from  END367 

...  creating  node  INDIRECT-REQUEST368 

...  considering  TALK1  as  S  of  INDIRECT-REQUEST368 

...  checking  constraints  on  INDIRECT-REQUEST368 

...  INDERECT-REQUEST368  is  okay 

...  searching  up  from  INDIRECT-REQUEST368 

...  creating  node  INDIRECT-REQUEST369 

...  searching  up  from  INDIRECT-REQUEST369 

...  creating  node  REQUEST370 

...  searching  up  from  REQUEST370 

...  creating  node  OBTAIN-BY-ASKING371 

...  considering  REQUEST370  as  Si  of  OBTAIN-B Y-ASKING371 

...  checking  constraints  on  OBTAIN-B  Y-ASKING37 1 

...  OBTAIN-B Y-ASKING371  is  okay 

...  searching  up  from  OBTAIN-B  Y-ASKING371 

...  creating  node  OBTAIN-B  Y-ASKING372 

...  searching  up  from  OBTAIN-B  Y-ASKING372 

...  creating  node  OBTAIN373 

...  searching  up  from  OBTAIN373 

...  merging  graph  at  END367 

...  creating  node  FINDOUT-BY-ASK1NG374 

...  considering  REQUEST370  as  SI  of  FINDOUT-B Y-ASKING374 

...  checking  constraints  on  FINDOUT-B Y-ASKING374 

...  equality  constraint  violated  INFORMEDIF  =■  GAVE 

...  FINDOUT-B Y-ASKING374  fails 

...  FINDOUT-B  Y-ASK1NG374  fails 
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T 

;;  The  statement  is  ambiguous,  as  we  see: 

Command:  (DRAW-GRAPH  T) 
graph  UTTER  1 
END367  NIL 
?«  OBTAIN373 

((OBJECT  SALT) 

(AGENT  JOE) 

(TIME  #(TTME  -INF  4  5  +INF))) 

(PRETIME  #(TIME  -INF  4  -INF  +INF)) 

?-  OBTAIN-BY-ASKING372 

((AGENT  JOE)  (OBJECT  SALT) 

(TIME  #(T1ME  -INF  4  -INF  +INF)) 

(PRETIME  #(TIME  -INF  4  -INF  +INF))) 

?-  OBTAIN-BY-ASKING371 

((AGENT  JOE)  (OBJECT  SALT) 

(TIME  #(TIME  -INF  4  -INF  +INF)) 

(PRETIME  #(TIME  -INF  4  -INF  +INF))) 

SI  ->  REQUEST370 

((REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  4  4  5  5))) 

?-  INDIRECT-REQUEST369 

((SPEAKER  JOE)  (HEARER  SALLY) 
(REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(TIME  #(TIME  4  4  5  5))) 

?*  INDIRECT-REQUEST368 
((SPEAKER  JOE) 

(HEARER  SALLY) 

(REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(TIME  #(TIME  4  4  5  5))) 
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S->  TALK1 

((CONTENT  (CAN  SALLY 

(GAVE  SALLY  JOE  SALT))) 

(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  4  4  5  5))) 

?■  FINDOUT366 

((INFO  (CAN  SALLY  (GAVE  SALLY  JOE  SALT))) 

(AGENT  JOE) 

(PRETIME  #(T1ME  -INF  4  -INF  +INF)) 

(TIME  #(TIME  -INF  4  5  +INF))) 

?-  FINDOUT-BY-ASKING365 
((AGENT  JOE) 

(INFO  (CAN  SALLY  (GAVE  SALLY  JOE  SALT))) 

(TIME  #(TIME  -INF  4  5  +INF)) 

(PRETIME  #(TIME  -INF  4  -INF  +INF))) 

?»  FINDOUT-BY-ASKING364 
((AGENT  JOE) 

(INFO  (CAN  SALLY  (GAVE  SALLY  JOE  SALT))) 

(TIME  #(TIME  -INF  4  5  +INF)) 

(PRETIME  #(TIME  -INF  4  -INF  +INF))) 

Sl->  REQUEST362 

((REQGOAL  ( INFORMED! F  SALLY  J(  )E 

(CAN  SAIL  (GAVE  SALLY  JOE  SALT))) 

(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  445  5))) 

?»  DIRECT-REQUEST361 

((SPEAKER  JOE) 

(HEARER  SALLY) 

(REQGOAL  (INFORMEDIF  SALLY  JOE 

(CAN  SALLY  (GAVE  SALLY  JOE  SALT)))) 
(TIME  #(TIME  4  4  5  5))) 

?=  DIRECT-REQUEST360 
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((SPEAKER  JOE) 

(HEARER  SALLY) 

(REQGOAL  (INFORMEDIF  SALLY  JOE 

(CAN  SALLY  (GAVE  SALLY  JOE  SALT)))) 
(TIME  #(TIME  445  5))) 

S ->  TALK1 

((CONTENT  (CAN  SALLY 

(GAVE  SALLY  JOE  SALT))) 
(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  4  4  5  5))) 

;;  There  are  2  (of  4)  alternatives  not  eliminated:  an  attempt  to  find  out  if  Sally 
;;  can  give  Joe  the  salt,  or  an  attempt  to  obtain  the  salt 

;;  Now  lets  add  the  fact  that  at  all  times,  Joe  knows  if  Sally  can  give  him  the  salt 

The  implementation  did  not  include  an  explicit  Holds 
predicate;  instead,  it  included  a  fuzzy  time  constraint  as 
the  second  argument  to  fact  predicates.  The  following 
assertion  means 

V  t  €  (— oo  — oo  4.00  +00) 

Holds(t  (knowif  Joe  (can  Sally  (gave  Sally  Joe  Salt)))) 

Command:  (ADD-FACT  ’(KNOWIF  #(TIME  :-lNF  :-INF  :+INF  :+IND  JOE 
(CAN  SALLY  (GAVE  SALLY  JOE  SALT))) 

T 

;;  Repeat  the  example.  Joe  says  to  Sally:  "Can  you  give  me  the  salt'?" 

Command  (EXPLAIN-OBSERVATION  '(TALK2  SURFACE-QUESTION 

(SPEAKER  JOE)  (HEARER  SALLY) 

(CONTENT  (CAN  SALLY  (GAVE  SALLY  JOE  SALT)))) 
’UTTER2) 

...  creating  node  TALK2 
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...  searching  up  from  TALK2 

...  creating  node  DERECT-REQUEST375 

...  considering  TALK2  as  S  of  DERECT-REQUEST375 

...  checking  constraints  on  DIRECT-REQUEST375 

...  D1RECT-REQUEST375  is  okay 

...  searching  up  from  DIRECT-REQUEST375 

...  creating  node  DERECT-REQUEST376 

...  searching  up  from  DERECT-REQUEST376 

...  creating  node  REQUEST377 

...  searching  up  from  REQUEST377 

...  creating  node  OBTAIN-BY-ASKING378 

...  considering  REQUEST377  as  SI  of  OBTAIN-BY-ASKING378 

...  checking  constraints  on  OBTAIN-BY-ASKING378 

...  equality  constraint  violated  GAVE  » INFORMEDEF 

...  OBTAIN-BY-ASKING378  fails 

...  OBTAIN-BY-ASKING378  fails 

...  creating  node  FINDOUT-BY-ASKING379 

...  considering  REQUEST377  as  SI  of  FINDOUT-BY-ASKING379 

...  checking  constraints  on  FINDOUT-BY-ASK1NG379 

...  fact  constraint  violated  (NEVER  #(TIME  -INF  4  -INF  +INF)  KNOWIF  JOE 

(CAN  SALLY  (GAVE  SALLY  JOE  SALT))) 

...  FINDOUT-BY-ASKING379  fails 

...  REQUEST377  fails 

...  DERECT-REQUEST376  fails 

...  DIRECT-REQUEST375  fails 

...  creating  node  INDIRECT-REQUEST3 80 

...  considering  TALK2  as  S  of  INDIRECT-REQUEST380 

...  checking  constraints  on  INDIRECT-REQUEST380 

...  INDIRECT-REQUEST380  is  okay 

...  searching  up  from  INDIRECT-REQUEST380 

...  creating  node  INDIRECT-REQUEST 3 8 1 

...  searching  up  from  INDIRECT-REQUEST381 

...  creating  node  REQUEST382 

...  searching  up  from  REQUEST382 
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...  creating  node  OBTAIN-BY-ASKING383 

...  considering  REQUEST382  as  SI  of  OBTAIN-BY-ASKING383 

...  checking  constraints  on  OBTAIN-BY-ASKING383 

...  OBTAIN-BY-ASKING383  is  okay 

...  searching  up  from  OBTAIN-B  Y-ASKING383 

...  creating  node  OBTAIN-BY-ASKING384 

...  searching  up  from  OBTAIN-B  Y-ASKING384 

...  creating  node  OBTAIN385 

...  searching  up  from  OBTAIN385 

...  creating  node  END386 

...  searching  up  from  END386 

...  creating  node  FINDOUT-BY-ASKING387 

...  considering  REQUEST382  as  SI  of  FINDOUT-BY-ASKING387 

...  checking  constraints  on  FINDOUT-BY-ASKING387 

...  equality  constraint  violated  INFORMED  IF  »  GAVE 

...  FINDOUT-BY-ASKING387  fails 

...  FINDOUT-BY-ASKING387  fails 

T 


;;  The  only  interpretation  is  the  indirect  request 


Command:  (DRAW-GRAPH  T) 
graph  UTTER2 
?»  OBTAIN385 

((OBJECT  SALT) 

(AGENT  JOE) 

(TIME  #(TIME  -INF  4  5  +INF))) 

(PRETIME  #(TIME  -INF  4  -INF  +INF)) 

?-  OBTAIN-BY-A,SKING384 

((AGENT  JOE)  (OBJECT  SALT) 

(TIME  #(TIME  -INF  4  -INF  +INF)) 
(PRETIME  #(TIME  -INF  4  -INF  +INF))) 
?»  OBTAIN-BY-ASKING383 

((AGENT  JOE)  (OBJECT  SALT) 
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(TIME  #(T1ME  -INF  4  -INF  +INF)) 

(PRETIME  #(TIME  -INF  4  -INF  +INF))) 

SI  ->  REQUEST382 

((REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  4  4  5  5))) 

?« INDIRECT-REQUEST3  8 1 

((SPEAKER  JOE)  (HEARER  SALLY) 
(REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(TIME  #(TIME  44  5  5))) 

?-  INDERECT-REQUEST380 
((SPEAKER  JOE) 

(HEARER  SALLY) 

(REQGOAL  (GAVE  SALLY  JOE  SALT)) 
(TIME  #(TIME  445  5))) 

S ->  TALK2 

((CONTENT  (CAN  SALLY 

(GAVE  SALLY  JOE  SALT))) 
(HEARER  SALLY) 

(SPEAKER  JOE) 

(TIME  #(TIME  4  4  5  5))) 
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Appendix  F 
Details  of  Correctness 
Proof  of  Explain 

The  key  to  the  correctness  of  explain  -  the  fact  that  all  possible  interpretations 
are  considered  -  lies  in  the  interaction  of  the  redundant  subroutine  and  the  definition 
of  the  set  Uses.  Redundant  blocks  consideration  of  uses  of  an  event  which  are 
covered  by  other,  more  apt  uses.  For  example,  if  the  primary  event  to  be  explained  is 
of  type  MakeMaxinara,  the  use  (MakeSauce,  s2,  MakePastaDish)  is  redundant,  while  if 
the  primary  event  to  be  explained  is  of  type  MakeSauce,  the  use  (MakeMarinara,  s2, 
MakeSpaghetaMarinara)  is  redundant. 

Following  is  the  statement  which  must  be  shown  to  hold  in  order  for  explain 
to  be  correct 

Statement  of  the  Problem 

Consider  the  invocation  ' 

explain(  Eo,  D,  0,  true,  Eq) 

For  every  (event  type)  E«  such  that  Eu  has  an  r-direct  component  of  type  Ec  which  is 
compatible  with  Eo,  and  for  every  basic  specialization*  Ej,  of  Eu  whose  instances  could 
possibly  have  an  r-component  of  Eo: 

the  algorithm  creates  a  node  Ea(ng)  such  that  Eg  abstracts*  Et>,  whose  r- 
component  Eg(ng)  is  a  node  of  type  compatible  with  Eo- 

Multiple  Inheritance  Proof 

By  inspection  we  see  that  explain  does  create  a  node  Ec(nc),  which  holds  one  of  the 
following  relations  to  Eo(no). 
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Ec(nc) 


EO(nO) 


or 


Ec(nc) 


Es(ns) 


EO 

where 

Es 


figure  F.1:  Relation  ofEc(nc)  to  Eo(no) 


If  the  component/use  inference  step  performs  a  recursive  call  for  use  (Ec,  r,  Eu),  then 
the  condition  is  satisfied  by  Eu(ntt)  itself.  If  this  doesn’t  occur,  then  the  redundant 
test  must  have  blocked  the  use.  So  consider  the  ways  in  which  redundant  could  be 
satisfied 


Case  L  (Ec,  r,  EJ  abstracts  a  use  for  a  type  which  abstracts*  primary.  There  .  .ust 
be  a  least  abstract  such  type,  call  it  Ei,  with  use  (Ei,  r,  Ed).  This  is  illustrated  below. 


figure  F 2:  Case  1 

When  explain  visited  Ei(ni)  it  must  have  considered  all  r-uses  of  Et,  since  by 
inspection  redundant  could  not  block  them.  Now  we  claim  that  there  are  r-uses 
which  fulfil  the  condition. 
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i 


Let  Eb  be  the  specialization  of  Eu  in  question.  Consider  the  generation  of  Uses.  Let  Ea 
be  the  most  general  abstraction*  of  Eb  such  that: 

L  All  specializations*  of  Ea  could  have  an  r-th  component  of  type  Ei 
ii.  (Ei,  r,  Ea)  6  Ua 

By  definition  of  Ea,  (Ei,  r,  Ea)  appears  in  Uses.  Thus  node  Ea(na),  with  roleval 
(r,  ni),  is  exactly  the  node  needed  for  the  proof. 

Case  2.  (Ec,  r,  Ett)  specializes  a  use  for  a  type  which  specializes*  primary.  There 
must  be  a  most  general  such  type,  call  it  Ej,  with  use  (Ej,  r,  Ea).  This  is  illustrated 
below. 


EO 


figure  F 3:  Case  2 

By  inspection  we  see  that  the  redundant  test  cannot  block  this  use,  so  node  Ea(na)  is 
the  desired  node,  because  Eu  abstracts*  Eb  and  Ea  abstracts*  Eu. 

Case  3.  (Ec,  r,  Eu)  specializes  a  use  for  a  type  which  abstracts  Eo  and  Ec  does  not 
abstract*  or  specialize*  Eo  There  must  be  a  least  general  such  type,  call  it  Ej,  with  use 
(Ei,  r,  Eu).  Let  E2  be  the  least  general  abstraction*  of  Eo  which  has  a  use  (E2,  r,  £f) 
which  specializes*  (Ei,  r,  Eu).  (Possibly  E2»Ei.)  This  is  illustrated  below. 
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figure  F. 4:  Case  3 

Let  Ea  be  the  most  general  abstraction*  of  Eb  such  that 

i  AH  specializations*  of  Ea  could  possibly  have  an  r-th  component  of 
typeE2. 

ii.  (E2,  r,  Ea)  €  U« 

By  definition  of  Ea,  (E2,  r,  Ea)  appears  in  Uses.  By  inspection  we  sec  that  this  use 
cannot  be  blocked  by  redundant,  so  Ea(na),  with  roleval  (r,  n2),  is  the  desired  node. 


Single  Inheritance  Proof 

Consider  the  single-inheritance  version  of  redundant.  There  are  two  conditions  under 
which  redundant  returns  true: 

Case  1.  (etype,  r,  utype)  abstracts  a  use  for  some  member  of  visited.  Because 
abstractions  of  primary  are  considered  before  specializations  of  primary,  it  must  be 
the  case  that  (etype,  r,  utype)  abstracts  a  use  for  a  type  which  abstracts*  primary:  that 
is,  the  first  case  above. 
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Case  2.  (etype,  r,  utype)  specializes  a  use  for  some  member  of  visited.  Because 
visited  is  grown  depth  first,  it  must  be  the  case  that  (etype,  r,  utype)  specializes  a  use 
for  a  type  which  specializes*  primary:  that  is,  the  second  case  above. 

This  completes  the  details  of  the  proof  of  correctness  of  explain. 
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Abstract 

In  this  paper  we  present  a  semantic  model  that  is 
used  to  interpret  a  logic  that  represents  con¬ 
current  actions  having  temporal  extent.  In  an 
earlier  paper  (Pelavin  and  Allen,  1986]  we 
described  how  this  logic  is  used  to  formulate 
planning  problems  that  involve  concurrent 
actions  and  external  events.  In  this  paper  we 
focus  on  the  semantic  structure.  This  structure 
provides  a  basis  for  describing  the  interaction 
between  actions,  both  concurrent  and  sequential, 
and  for  composing  simple  actions  to  form  com¬ 
plex  ones.  This  model  can  also  treat  actions  that 
are  influenced  by  properties  that  hold  and  events 
that  occur  during  the  time  that  the  action  is  to 
be  executed.  Each  model  includes  a  set  of 
world-histories,  which  are  complete  worlds  over 
time,  and  a  function  that  relates  world-histories 
that  differ  solely  on  Ike  account  of  an  action  exe¬ 
cuted  at  a  particular  time.  This  treatment 
derives  from  the  semantic  theories  of  conditionals 
developed  by  Stalnaker  [Stalnaker,  1968)  and 
Lewis  [Lewis,  1973). 

I.  Introduction 

One  of  the  most  successful  approaches  to  representing 
events  and  their  effects  in  Artificial  Intelligence  has 
been  situation  calculus  [McCarthy  and  Hayes, 1969]. 
In  this  logic,  an  event  is  modeled  by  a  function  from 
situation,  i.e.  instantaneous  snapshot  of  the  world,  to 
situation.  This  function  captures  the  state  changes 
produced  by  the  event  in  different  situations. 

A  deficiency  of  this  representation  is  that  simul¬ 
taneous  events  cannot  be  directly  modeled;  one  cannot 
describe  the  result  produced  by  two  events  initiated  in 
the  same  situation  (see,  however,  Georgeff  [Georgeff, 
1986]  who  extends  and  modifies  situation  calculus  so 
this  can  be  done).  Another  deficiency  is  that  situation 
calculus  does  not  capture  what  is  happening  while  an 
event  is  occurring.  Thus,  one  cannot  directly  treat 
events  that  are  affected  by  conditions  that  hold  during 
execution,  such  as  the  event  "sailing  across  the  lake 
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which  can  occur  only  if  the  wind  is  blowing  while  the 
sailing  is  taking  place. 

Allen  [Allen,  1984]  and  McDermott  [McDermott, 
1982)  have  put  forth  logics  that  represent  simultane¬ 
ous  events  and  events  with  temporal  extent.  Allen 
develops  a  linear  time  model  based  on  intervals,  i.e. 
contiguous  chunks  of  time.  McDermott  describes  a 
branching  time  model  where  a  set  of  instantaneous 
states  are  arranged  into  a  tree  that  branches  into  the 
future.  McDermott  uses  the  term  "interval"  to  refer  to 
a  convex  set  of  states  that  lie  along  a  branch  in  the 
tree  of  states. 

In  both  logics,  an  event  is  equated  with  the  set  of 
intervals  over  which  the  event  occurs.  Properties, 
which  capture  static  aspects  of  the  world,  are  treated 
in  a  similar  manner.  Each  property  is  equated  with 
the  set  of  intervals  over  which  the  property  holds. 
Simultaneous  events  can  be  described  by  stating  that 
two  events  occur  over  intervals  that  overlap  in  time. 
One  can  also  describe  the  properties  that  hold  and  the 
events  that  occur  while  some  event  takes  place. 

Although  these  logics  overcome  some  of  the 
deficiencies  of  situation  calculus,  they  are  not  ade¬ 
quate  for  reasoning  about  actions  and  forming  plans. 
These  logics  lack  a  structure  analogous  to  the  result 
function  in  situation  calculus  that  describes  the  result 
of  executing  different  actions  in  different  contexts.  In 
situation  calculus,  the  context  is  given  by  the  situation 
in  which  an  action  is  to  be  initiated.  At  each  situa¬ 
tion,  one  can  describe  whether  an  action  can  be  suc¬ 
cessfully  executed  and  describe  whether  an  action 
negates  some  property  or  does  not  affect  it.  This  struc¬ 
ture  also  provides  a  simple  basis  for  constructing  com¬ 
plex  actions,  i.e.  sequences  of  actions. 

Without  extension,  similar  statements  cannot  be 
made  in  Alien’s  and  McDermott’s  logics.  For  example, 
one  cannot  describe  that  an  action  does  not  affect 
some  property  or  event,  such  as  stating  that  raising 
one's  arm  does  not  affect  whether  it  is  raining  out. 
One  cannot  represent  that  some  action  can  be  exe¬ 
cuted  only  under  certain  conditions,  such  as  stating 
that  the  agent  can  edit  a  document  during  interval  i 
only  if  the  text  editor  is  operational  during  i. 

Allen’s  and  McDermott's  logics  do  not  provide  a 
basis  for  relating  the  conditions  under  which  a  set  of 
actions,  concurrent  or  sequential,  can  be  executed 
together  to  the  conditions  under  which  the  actions, 
making  up  the  set,  can  be  executed  individually. 
Whether  two  actions  can  be  executed  together 
depends  on  how  they  interact.  For  example,  one  ®ajr 
be  able  to  execute  two  actions  individually,  but  not 
concurrently,  Such  as  "moving  one’s  hand  up"  and 
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“moving  one’s  hand  down”.  It  might  be  the  case  that 
two  actions  can  be  executed  together  only  under  cer¬ 
tain  conditions,  such  as  two  concurrent  actions  that 
share  the  same  type  of  resource.  Allen’s  and 
McDermott’s  logics  can  express  ’If  actions  al  and  a2 
both  occur  during  i,  then  there  must  be  at  least  two 
resources  available  during  i”.  These  logics,  however, 
cannot  distinguish  whether  "there  are  at  least  two 
resources  available  during  i"  is  a  necessary  condition 
that  must  hold  in  order  to  execute  al  and  a2  together, 
or  whether  this  condition  is  an  effect  produced  by  the 
joint  execution  of  al  and  a2.  A  detailed  discussion  of 
these  issues  is  given  in  [Pelavin,  1987]. 1 

To  remedy  these  problems,  we  develop  a  semantic 
model  that  contains  a  structure  analogous  to  the 
result  function  in  situation  calculus.  In  our  models, 
world -histories  and  action  instances  take  the  place  of 
situations  and  actions.  A  world-history  refers  to  a 
complete  world  over  time,  rather  than  an  instantane¬ 
ous  snapshot.  An  action  instance,  refers  to  an  action 
to  be  performed  at  a  specified  time.  A  world-history 
serves  as  the  context  in  which  the  execution  of  an 
action  instance  is  specified.  This  enables  us  to  model 
the  influence  of  conditions  that  may  hold  during  the 
time  that  an  action  instance  is  to  be  executed,  and,  as 
we  will  see,  provides  a  simple  basis  for  modeling  con¬ 
current  interactions  and  for  defining  the  joint  execu¬ 
tion  of  a  set  of  action  instances. 

To  describe  these  models,  we  extend  Allen's 
language,  which  is  a  first  order  language,  with  two 
modal  operators.  In  this  paper,  we  only  describe  the 
underlying  semantic  structure  and  do  not  discuss  the 
syntax  or  interpretation  of  this  modal  language. 
Moreover,  we  focus  on  the  portion  of  the  model  that 
pertains  to  modeling  actions,  after  briefly  describing 
the  other  components  in  the  model  structure.  The 
reader  interested  in  the  language,  axiomatics,  or  other 
details  omitted  in .  this  paper  can  refer  to  JPelavin, 
1987]  and  (Pelavin  and  Allen,  1988]. 

II.  Overview  of  the  model  structure 

In  each  model,  a  set  of  world-histories  and  set  of  tem¬ 
poral  intervals  are  identified.  Each  temporal  interval 
picks  out  a  common  time  across  the  set  of  world- 
histories.  The  intervals  are  arranged  by  the  MEETS 

vA&'SS,.,1®  !°rm  a  8lobal  date  1'ne-  The  relation 
.  1EETS(i1,i2)  is  true  if  interval  il  is  immediately  prior 
to  interval  it.  In  (Allen  and  Hayes,  1985],  it  is  shown 
that  all  temporal  interval  relations,  such  as  "overlaps 

MEETSr'ght  aDd  starts"  can  be  defined  terms  of 

The  model  identifies  the  set  of  properties  and 
events  that  hold  (occurl  at  various  times  in  the 
<Ji  tie  rent  world- histories,  formally,  events  and  proper¬ 
ties  are  sets  of  ordered  pairs,  each  formed  by  an  inter¬ 
val  and  a  world-history.  If  <i,h>  G  ev,  then  event  eti 
occur,  during  interval  »  in  world-history  h.  Similarly, 

.  <■>“>  G  pr,  then  property  pr  holds  during  interval  » 
in  world-history  h.  To  capture  the  relation  "if  pro¬ 
perty  pr  holds  over  an  interval  i  then  pr  holds  over  all 
intervals  contained  in  ",  we  restrict  the  models  so 

1  For  example,  in  (Pelavin,  1987]  we  ihow  why  a  branching 
tune  model  cannot  be  uied  to  interpret  "action  *1  can  be 
executed  during  time  “  ir  we  want  to  treat  action*,  ruch  a* 
sailing",  that  are  influenced  by  condition*  that  hold  Hnrlr,. 
execution. 


that  if  il  is  contained  in  it  and  <i2,h>  G  pr,  then 
<il,h>  G  pr. 

World-histories  are  arranged  into  trees  that 
branch  into  the  future  by  the  R  accessibility  relation 
which  takes  an  interval  and  two  world-histories  as 
arguments.  Intuitively,  Rn,bl,h2)  means  that  hi  and 
ht  share  a  common  past  through  the  end  of  interval  » 
and  are  possible  with  respect  to  each  other  at  ».  This 
structure  is  identical  to  one  found  in  [Haas,  1985]  with 
the  exception  that  Haas  uses  a  time  point  to  relate 
world-theories,  rather  than  the  end  of  an  interval. 
Constraints  are  placed  on  R  to  insure  that  il  it  is  an 
equivalence  relation  for  a  fixed  interval,  ii)  if 
R(il,hl,h2),  then  hi  and  ht  agree  on  all  events  and 
properties  that  end  before  or  at  the  same  time  as  il 
and  iii)  if  R(il,hl,h2),  then  R(i2,hl,h2)  for  all  intervals 
i2  that  end  at  the  same  time  as  or  before  il. 

In  situation  calculus,  the  execution  of  an  action  is 
given  with  respect  to  a  situation,  and  an  action  is 
modeled  as  a  function  from  situation  to  situation,  hi 
our  model,  the  execution  of  an  action  instance  is  given 
with  respect  to  a  world-history,  and  an  action  instance 
is  modeled  as  a  function  from  world-history  to  set  of 
world-histories.  The  rest  of  the  paper  is  devoted  to 
describing  this  function  and  showing  how  a  function 
associated  with  a  set  of  action  instances  can  be  con 
structed  from  the  functions  associated  with  its 
members. 

In  this  paper,  we  will  only  discuss  a  type  of  action 
instance  called  a  basic  action  instance.  Basic  actions 
[Goldman,  1970]  refer  to  actions  that  are  primitive  in 
the  sense  that  all  non-basic  actions  are  brought  about 
by  performing  one  or  more  basic  actions  under 
appropriate  conditions.  In  (Pelavin,  1987],  we  describe 
how  all  other  action  instances  (which  we  refer  to  as 
plan  instances")  are  defined  in  terms  of  basic  action 
instances. 

III.  The  Fci  function 

The  result  of  executing  a  basic  action  instance  with 
respect  to  a  world-history  is  given  by  the  Fel  function. 
FC|  takes  a  basic  action  instance  bai  and  a  world- 
history  A  as  arguments  and  yields  a  nonempty  set  of 
world-histories  that  "differ  from  A  solely  on  the 
account  of  the  occurrence  of  bat".  Equivalently,  we  say 
that  the  world-histories  belonging  to  Fel(bai,h)  are  the 
closest  world-histories"  to  A  where  basic  action 
instance  bat  occurs.  The  term  “closest”  is  a  vestige 
from  Stalnaker’s  [Stalnaker,  1968]  and  Lewis’  (Lewis, 
1973]  semantic  theories  of  conditionals  from  which 
our  treatment  derives. 

In  the  remainder  of  this  section,  we  explain  what 
we  mean  by  "differing  solely  on  the  account  of  the 
occurrence  of  a  basic  action  instance"  and  present  the 
constraints  that  are  imposed  on  Fd  in  accordance  with 
these  intuitions.  Very  briefly,  if  hi  belongs  to 
r  c|(bai,h),  then  A  and  hZ  will  coincide  on  all  conditions 
that  are  not  affected  by  the  occurrence  of  bai.  This 
includes  conditions  out  of  the  agent’s  control,  such  as 
whether  or  not  it  is  raining  during  some  interval,  and 
conditions  that  only  refer  to  times  that  end  before  bai. 
One  reason  that  Fct  yields  a  set  of  world-histories, 
rather  than  a  single  one,  is  to  provide  for  non- 
deterministic  basic  actions.  Another  reason  for  treat- 
mg  Fcl  as  a  set  is  explained  later. 
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We  use  a  term  of  the  form  ”ba@t'  to  refer  to  a 
basic  action  instance  whose  time  of  occurrence  is  i. 
The  treatment  of  FC|(ba@i,h)  is  trivial  when  ba@i 
occurs  in  k.  In  this  case,  Fcl(ba@i,h)  is  equal  to  {h} 
reflecting  the  principle  that  a  world-history  is  closer  to 
itself  than  any  other  world-history.  This  is  captured 
by  the  following  constraint  which  is  imposed  on  our 
models: 

BAI) 

For  all  basic  action  instances  (ba@i), 
and  world-histories  (h),  if  h  €  OC(ba@i), 
then  Fe,(ba@iJ>)  -  {h> 

where  OC(ba@i)  is  the  set  of  world-histories 
in  which  ba@i  occurs 

FC|(ba@i,h)  is  also  set  to  {h}  when  ba@fs  stan¬ 
dard  conditions  do  not  hold  in  h.  The  term  "standard 
conditions"  is  taken  from  Goldman  [Goldman,  1070] 
although  we  use  it  in  more  general  way.  A  basic 
action's  standard  conditions  are  conditions  that  must 
hold  in  order  to  execute  the  action.  For  example,  the 
standard  conditions  for  "the  agent  moves  its  right  arm 
up  during  time  "  include  the  condition  that  the  arm  is 
not  broken  during  time  i  We  also  use  standard  condi¬ 
tions  to  refer  to  the  conditions  under  which  a  move  is 
legal  when  modeling  a  board  game. 

If  6a@i’s  standard  conditions  do  not  hold  in  k, 
then  FC|(ba@i,h),  which  equals  {h},  contains  a  world- 
history  in  which  ba@t  does  not  occur.  In  effect,  if 
ba@ «’s  standard  conditions  do  not  occur  in  k,  we  are 
not  defining  "the  closest  world-history  to  A  where 
bai@i  occurs".  We  treat  the  lack  of  standard  condi¬ 
tions  this  way  because  we  want  to  restrict  Fc)  so  that 
if  kt  belongs  to  Fd(ba@,h)  then  kS  and  h  agree  on  all 
conditions  that  are  not  affected,  directly  or  indirectly, 
by  ba@i.  This  restriction  would  be  violated  if  bai@5’s 
standard  conditions  did  not  hold  in  A,  but  Fe|(bai@i,h) 
contained  a  world-histopr  hi  where  ba@i  occurs.  This 
stems  from  an  assumption  that  a  basic  action  cannot 
affect  whether  or  not  its  own  standard  conditions  hold. 

Fc)(ba@i,h)  yields  a  non-trivial  result  when 
6«@r s  standard  conditions  hold  in  k,  but  6a@t  does 
not  occur  in  k.  In  this  case,  all  the  members  belonging 
to  Fc|(ba@i,h)  differ  from  A  and  6a@i  occurs  in  all 
these  world-histories.  Consequently,  we  impose  the 
following  constraint: 

BA2) 

For  all  world-bistories  (h  and  h2) 
and  basic  action  instances  (ba@i), 
if  Fc|(ba@i.h)  *  {h}  then 
Fcl(ba@i,h)  Q  OC{ ba@i) 

Typically,  when  hi  belongs  to  Fd(ba@,h)  and  kS 
is  distinct  from  A  (which  we  will  assume  in  the  rest  of 
this  section),  the  two  world-histories  will  differ  on 
more  than  the  status  of  "ba@i  occurs".  We  assume 
that  the  set  of  world-histories  adhere  to  a  set  of  laws 
that  govern  the  relations  between  events,  properties, 
and  other  objects  in  the  world-histories.  A  world- 
history  formed  by  just  modifying  A  to  make  ”ba@i 
occurs"  true  may  violate  some  laws.  Consequently,  A 
and  kt  will  also  differ,  on  some  conditions  that  are 
related,  directly  or  indirectly,  to  ”ba@i  occurs"  by 
some  set  of  laws. 


As  an  example,  suppose  that  property  prS  does 
not  hold  during  interval  ig  in  A,  but  there  is  a  law 
that  entails  that  if  ba@i  occurs  then  pr2  holds  during 
ig.  Consequently,  A  and  kS  must  differ  on  the  status 
of  “pr2  holds  during  if"  since  ba@i  occurs  in  hS. 
World-histories  A  and  hS  may  also  differ  on  conditions 
that  are  indirectly  affected  by  ba@i.  Suppose  that 
there  is  a  second  law  that  entails  that  if  pr2  holds 
during  if  then  prS  holds  during  tS.  If  prS  does  not 
hold  during  iS  in  A,  then  A  and  Af  will  also  differ  on 
this  condition. 

As  a  second  example,  consider  a  law  that  entails 
that  ba@i  and  ba2@i  cannot  occur  together.  Thus,  if 
ba2@i  occurs  in  A,  any  world-history  Af  belonging  to 
Fc|(ba@,h)  will  differ  from  A  because  ba2@t  does  not 
occur  in  Af.  This  type  of  relation,  as  we  will  see, 
forms  the  basis  for  detecting  interference  between 
basic  action  instances  and  is  used  when  composing 
basic  action  instances  together. 

We  assume  that  the  difference  between  A  and  Af 
are  minimal  in  that  changes  are  only  made  in  going 
from  A  to  Af  to  satisfy  laws  that  would  be  violated  if 
these  changes  were  not  made.  They  agree  on  all  other 
conditions.  This  includes  conditions  out  of  the  agent's 
control  such  as  whether  or  not  it  is  raining  out.  We 
also  constrain  our  models  so  that  A  and  Af  agree  on  all 
conditions  that  refer  to  times  that  are  prior  to  ba@t  s 
time  of  occurrence.  This  is  captured  by  a  constraint 
relating  Fd  to  the  R  relation  which  is  given  as  follows: 

BA-Rl) 

For  all  world-histories  (hi  and  h2), 

basic  action  instances  (ba@i),  and  intervals  (iO), 

if  h2  €  Fc|(ba@i,h)  and  MEETS(iO,i),  then  RfiOA.h2) 

BA-Rl  entails  the  relation  that  two  world-histories 
differing  on  the  occurrence  of  ba@i  must  coincide  on 
all  conditions  that  end  before  the  beginning  of  interval 
t.  This  restriction  presupposes  that  there  are  no  laws 
specifying  whether  or  not  a  basic  action  instance, 
whose  standard  conditions  hold,  occurs. 

One  reason  why  Fd(ba@,b)  yields  a  set  of 
world-histories,  instead  of  a  single  one,  is  that  there 
may  be  many  ways  to  minimally  modify  A  to  account 
for  ba@ t’s  occurrence.  For  example,  suppose  that  only 
two  of  the  three  basic  action  instances,  bal@i,  bat@i, 
and  baS@t,  can  be  executed  together.  Also  assume 
that  both  baS@i  and  baS@i  occur  in  A.  In  this  case, 
FC|(bal@i,b)  will  contain  (at  least)  two  world-histories: 
one  where  both  bal@i  and  ba2@i  occur,  but  not 
baS@i,  and  another  where  bal@i  and  baS@i  occur, 
but  not  ba2@i. 

It  is  important  to  emphasize  that  the  Fcl  function 
is  part  of  the  semantic  model  and  thus  there  is  no 
need  to  precisely  specify  this  function  when  reasoning 
in  our  logic.  We  describe  the  world  using  a  set  of  sen¬ 
tences  in  our  language  (which  is  described  in  [Pelavin 
and  Allen,  1986]  and  jPelavin,  1987])  Typically,  a  set 
of  sentences  only  partially  describe  a  model;  there  may 
be  many  models  that  satisfy  a  set  of  sentences.  The 
Fc)  function  provides  a  simple  underlying  structure  to 
interpret  sentences  that  describe  what  a  basic  action 
instance  affects  and  does  not  affect  with  respect  to  a. 
context  that  may  include  conditions  that  hold  while 
the  basic,  action  instance  is  to  be  executed.  As  we  will 
see,  it  also  provides  a  simple  basis  for  modeling  basic 
action  instance  interactions  and  for  treating  the  joint- 
execution  of  a  set  of  basic  action  instances. 
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IV.  Composing  action  instances 

The  result  of  executing  a  set  of  basic  action  instances 
together  is  computed  from  the  individual  members  in 
the  set.  In  other  words,  Fci  applied  to  a  set  of  basic 
action  instances  bai-set  is  defined  in  terms  of  FC| 
applied  individually  to  each  member  in  bai-set  In  this 
section,  we  will  let  Fcl  take  a  set  of  basic  actions 
instances  as  an  argument,  rather  than  a  single  one;  Fc| 
applied  to  the  singleton  set  (bai)  is  to  be  treated  as 
we  described  FC|  applied  to  bat  in  the  last  section. 

Any  set  of  basic  action  instances  can  be  composed 
together  regardless  of  their  temporal  relation.  More¬ 
over,  the  definition  of  Fc|  applied  to  bai-set  does  not 
need  to  be  conditionalized  on  the  temporal  relations 
between  the  members  of  bai-set  So  for  example,  the 
composition  of  two  concurrent  basic  action  instances  is 
defined  in  the  same  way  as  the  composition  of  two 
basic  action  instances  that  do  not  overlap  in  time. 

The  following  notation  is  introduced  to  succinctly 
present  the  definition  of  Fc|  applied  to  a  basic  action 
instance  set  and  to  present  two  related  constraints. 

The  constructor  function  combines  two  func¬ 
tions  from  H  to  2"  to  form  a  function  from  H  to  2", 
where  H  denotes  a  set  of  world-histories: 
fx*fy(h)  (J  fy(hx) 

The  set  of  composition  functions  of  a  basic  action 
instance  set  is  recursively  defined  by: 

i)  A  singleton  basic  action  instance  set  {bai}  has  one  composi¬ 
tion  function:  Xh.Fc|({bai},h) 

ii)  The  composition  functions  of  a  basic  action  instance  set 
isi-set  with  more  than  one  element:  {bai'emp  |  bai  €  bai- 
set  and  cmp  is  a  composition  function  of  (bai-set  —  bai}} 

If  cmp  is  a  composition  function  of  bai-set,  then 
cmp(h)  yields  the  set  of  world-histories  that  would  be 
reached  by  successively  modifying  h  by  the  basic 
action  instances  belonging  to  bai-set  in  some  order. 

ALL-OC  relates  world- histories  and  composition 
functions: 

For  any  composition  function  of  bai-set  (cmp), 
ALL-OC(h,emp)  "«c|ef  cmp{h)  £  OC(bai-set) 

If  cmp  is  a  composition  function  of  bai-set,  then  ALL- 
OCfh.cmp)  is  true  iff  the  result  of  modifying  k  succes¬ 
sively  by  all  the  members  in  bai-set,  in  the  order  impli¬ 
cit  in  cmp,  yields  a  set  of  world-histories  where  ail  the 
members  in  bai-set  occur. 

Finally,  the  definition  of  Fe|  and  constraints  BA- 
CMPl  and  BA-CMP2  are  given  by: 


cmp(h)  If  there  exists  a  composition 

function  of  bai-set  (cmp) 

F«i(b*i-*et,h)  -M 

such  that  ALL-OC(h,cmp) 

(h)  Otherwise 

BA-CMPl) 

For  all  world-histories  (b)  and  basic  action  instance 
sets  (bai-set),  if  there  exists  two  composition  functions 
of  bai-set  (cmpl  and  cmp2)  such  that  ALL-OC(h.cmpl) 
and  ALL-OC(h,cmp2),  then  cmpl(h)  «■  cmp2(h) 

BA-CMP2) 

For  all  world-histories  (h)  and  composition 
functions  (cmpl  and  cmp2),if  ALL-OC(h,cmp2)  and 
ALL-OC(h,cmpl*cmp2)  then  ALL-OC(h,cmp2*cmpl) 

In  the  following  discussion,  we  examine  BA- 
CMPI,  BA-CMP2,  and  the  definition  of  Fc|(bai-set)  for 
the  case  where  bai-set  consists  of  two  basic  action 
instances,  bail  and  bait,  that  yield  unique  closest 
world- histories  when  Fc|  is  applied  to  either  of  them  at 
any  world-history.  In  [Pelavin,  1987],  a  detailed  expla¬ 
nation  is  provided  for  the  other  cases,  such  as  when 
bai-set  contains  three  or  more  members. 

The  set  {bail,bai2}  has  two  composition  func¬ 
tions,  which  we  will  denote  by  bail*bai2  and 
bai2*bail.  bail*bai2(h)  yields  a  singleton  set  contain¬ 
ing  the  world-history  obtained  by  modifying  h,  first  by 
bail,  then  by  bait.  bai2*bail(h)  yields  a  singleton  set 
containing  the  world-history  obtained  by  modifying  k , 
first  by  bait,  then  by  bail.  It  is  important  to  keep  in 
mind  that  bail  and  bait  have  fixed  times  associated 
with  them  and  consequently  may  have  any  temporal 
relation.  Thus,  bail*bai2(h)  does  not  necessarily 
describe  the  results  of  executing  bail  before  bait,  since 
bait  may  be  prior  to  or  concurrent  with  bail. 

Let  us  first  consider  the  case  where  bail's  and 
bait's  standard  conditions  hold  at  all  world-histories. 
We  say  that  bail  and  bait  interfere  at  world-history  k 
if  they  cannot  be  executed  together  in  the  context 
given  by  h.  If  they  interfere,  we  set  Fc)({bail,bai2},h) 
to  {h},  treating  {bail,bai2}  as  if  its  standard  condi¬ 
tions  do  not  hoM  at  A.  As  an  example,  "move  right 
hand  up  during  i"  and  "move  right  hand  down  during 
i"  are  basic  action  instances  that  interfere  at  all 
world-histories  (when  modeling  a  typical  world).  Con¬ 
versely,  "move  right  hand  up  during  i"  and  "move  left 
hand  down  during  i"  do  not  interfere  at  any  world- 
history. 

We  may  also  model  basic  action  instances  that 
conditionally  interfere,  ones  that  interfere  at  some 
world-histories  but  not  at  others.  For  example,  if  two 
concurrent  basic  action  instances  share  the  same  type 
of  resource,  they  interfere  only  at  world-histories 
where  there  is  not  enough  of  this  resource  available 
during  their  time  of  execution.  It  is  important  to  note 
that  interference  is  defined  relative  to  world-histories. 
Consequently,  whether  two  or  more  basic  actions 
interfere  can  depend  on  conditions  that  hold  during 
execution.  Some  other  treatments  of  interference  in 
the  AI  literature,  such  as  Georgeff  [Georgeff,  1986], 
provide  for  conditional  interference,  Dpt  only  in  the 
case  when  interference  depends  on  conditions  that 
hold  just  prior  to  execution. 

We  can  detect  whether  bail  and  bait  interfere  at 
a  world-history  A  by  examining  Fc|  applied  to  bail  and 
bait  individually.  Since  we  are  assuming  that  bail's 
(and  bait's)  standard  conditions  bold  everywhere, 
Fj|({bail},h)  yields  a  world-history  in  which  bail 
occurs.  Call  this  world-history  hx.  If  bail  and  bait 
interfere  at  A,  and  consequently  at  hx,  Fct(fbai2},hx) 
yields  a  world-history  where  bait  occurs  (since  its 
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standard  conditions  hold  at  kx),  but  not  bail.  If  they 
do  not  interfere,  both  bail  and  baiS  occur  in 
FC|({bai2},hx)  in  which  case  we  set  Fc|({bail,bai2},h) 
to  Fe)({bai2},hx).J  Since  Fc|({bai2},hx)  is  the  result  of 
modifying  A  first  by  bail,  then  by  baiS,  it  is  equivalent 
to  bail*bai2(h). 

We  can  also  detect  if  bail  and  baiS  interfere  by 
modifying  A  first  by  baiS,  then  by  bail.  bai2*bail(h) 
yields  this  world- history.  If  bail  and  baiS  interfere  at 
A,  then  bail,  but  not  baiS,  occurs  in  bai2*bail(h).  If 
they  do  not  interfere,  both  bail  and  baiS  occur  in 
bai2*bail(h).  Moreover,  if  they  do  not  interfere,  we 
assume  that  modifying  A,  first  by  bail,  then  by  baiS 
yields  the  same  world-history  obtained  by  modifying  A, 
first  by  baiS,  then  by  bail. 

The  definition  of  Fc)  and  constraints  BA-CMP1 
and  BA-CMP2  capture  the  treatment  described  above. 
If  bail  and  baiS  interfere  with  each  other  at  A,  then 
bail  and  baiS  do  not  both  occur  together  in  either 
bail*bai2(h)  or  bai2*bail(h).  Consequently, 
FC|({bail,bai2},h)  is  defined  as  {h}.  If  bail  and  baiS 
do  not  interfere,  then  they  occur  together  in  both 
bail*bai2(h)  and  bai2*bail(hl.  In  this  case 
Fc|({bail,bai2},h)  is  set  to  bail*oai2(h)  which  equals 
bai2*bail(h)  by  constraint  BA-CMPI.  For  the  case 
where  bail’s  and  baiS’s  standard  conditions  hold 
everywhere,  constraint  BA-CMP2  insures  that 
bail*bai2(h)  and  bail*bai2(h)  are  compatible;  they 
would  be  incompatible,  if  both  bail  and  baiS  occurred 
together  in  one  of  them,  signifying  that  bail  and  baiS 
did  not  interfere  at  A,  but  did  not  occur  together  in 
the  other,  signifying  they  did  interfere  at  A. 

The  analysis  described  above  also  applies  in  less 
restrictive  cases  where  bail's  and  baiS’s  standard  con¬ 
ditions  may  not  hold  at  all  world-histories.  This 
analysis  is  applicable  as  long  as  bail's  standard  condi¬ 
tions  hold  at  both  A  and  Fc|({bai2},h),  and  baiS’s  stan¬ 
dard  conditions  hold  at  both  A  and  Fci({bail},h). 

Let  us  now  consider  the  case  where  both  bail's 
and  baiSs  standard  conditions  hold  at  A,  but  the 
occurrence  of  one  of  the  basic  action  instances,,  say 
bail,  ruins  the  others  standard  conditions.  This  situa¬ 
tion  is  treated  as  interference;  Fc|({bail,bai2},h)  is  set 
to  {h}.  If  bail  ruins  baiS's  standard  conditions  with 
respect  to  A  then  baiS's  standard  conditions  do  not 
hold  in  Fe|({bail},h).  Consequently,  bail,  but  not  baiS, 
occurs  in  bail*bai2(h).  By  constraint  BA-CMP2,  both 
bail  and  baiS  will  not  occur  together  in  bai2*bail(h) 
either.  Thus,  by  the  definition  of  Fc(,  we  see  that 
Fci({bail,bai2},h)  is  set  to  {h}. 

The  next  case  to  consider  is  where  bail's,  but  not 
baiS's,  standard  conditions  hold  at  A.  In  this  situa¬ 
tion,  Fcl({bail,bai2},h)  is  set  to  {h}  unless  the  follow¬ 
ing  two  conditions  hold:  i)  the  occurrence  of  bail  with 
respect  to  A  brings  about  baiS's  standard  conditions, 
and  ii)  they  do  not  interfere  with  each  other  at  A.  If 
both  i)  and  ii)  hold,  then  both  bail  and  baiS  occur 
together  in  bail*bai2(h).  Consequently,  by  the 
definition  of  Fcl,  Fc)({bail,bai2},h)  is  set  to 
bail*bai2(h).  This  case  differs  from  the  situation 
where  both  bail's  and  baiS's  standard  conditions  hold 


2  World-history  Az  sad  the  world-history  in  Fc|({bail},hx)  are 
aot  necessarily  distinct  from  A.  For  example,  if  both  bail  sod 
bait  occur  in. A,  then  Fc|({bsil},h)  —  Fc|((bsi2)h)  •  {h}  by 
constraint  BA1;  consequently  Fc|({bsil,bsi2},h)  equsls  {h}. 


in  A;  if  bail  brings  about  baiS's  standard  conditions, 
then  bail  and  baiS  do  not  necessarily  occur  together  in 
bai2*bail(h)  even  though  they  do  not  interfere. 
Appropriately,  constraints  BA-CMPl  and  BA-CMP2 
are  n6t  applicable  in  this  case. 

The  last  case  to  consider  is  where  both  bail's  and 
baiS's  standard  conditions  do  not  hold  in  A.  In  this 
situation,  Fc|({bail,bai2},h)  is  simply  set  to  {h}. 

V.  Conclusion 

We  have  presented  a  model  that  provides  for  con¬ 
current  actions  having  temporal  extent.  We  have 
integrated  Allen’s  model  (Allen,  1984),  which  can  treat 
simultaneous  events  having  temporal  extent,  with  a 
structure  analogous  to  the  result  function  in  situation 
calculus.  This  structure  captures  the  result  of  execut¬ 
ing  an  action  at  a  specified  time  with  respect  to  a  con¬ 
text  given  by  a  world-history,  i.e.  a  complete  world 
over  time.  This  enables  us  to  model  actions  and 
action  interactions  that  are  affected  by  conditions  that 
hold  during  execution.  This  structure  also  provides  a 
simple  framework  for  composing  simple  actions,  both 
concurrent  and  sequential,  to  form  complex  ones. 
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Abstract 


An  abstraction  mapping  over  clausal  form  theories 
in  Brat-order  predicate  calculus  is  presented  that 
involves  the  renaming  of  predicate  symbols.  This 
renaming  is  not  1-1,  in  the  sense  that  several 
predicate  symbols  Ri R„  from  the  original 
theory  are  all  replaced  by  a  single  symbol  R’  in  the 
abstract  theory.  In  order  to  preserve  consistency, 
however,  the  clauses  that  distinguish  the  Rj’s  must 
be  discarded  in  the  abstract  theory.  This  leads  to  a 
simple  semantics;  the  union  of  the  extensions  of 
each  of  the  Ri’s  in  any  model  of  the  original  theory 
forms  the  extension  of  R’  in  a  model  of  the  abstract 
theory. 

1  Introduction 

An  important  method  of  constraining  search  in 
combinatorially  intractable  problems  is  to  map  the 
representation  of  the  problem  into  an  abstract 
representation,  solve  the  problem  in  the  abstract 
search  space  and  use  the  abstract  solution  to  guide 
the  search  in  the  original  problem  space.  Work  of 
this  nature  has  been  pursued  by  many  researchers, 
most  notably  [Amarel  1972,  Hobbs  1985,  Korf  1985, 
Plaisted  1981,  Sacerdoti  1974].  Of  importance  in 
this  work  is  that  certain  formal  properties  hold 
between  the  original  and  the  abstract  versions  of 
the  problem  in  order  to  justify  the  use  of 
abstraction.  In  this  paper  we  will  deBne  a  syntactic 
abstraction  mapping  between  two  Brat-order 
theories  that  involves  the  renaming  of  predicate 
symbols,  a  subclass  of  those  mappings  Brat  deBned 
in  [Plaisted  1981].  The  mappinVr  is  demonstrated  to 
have  the  following  properties.  First,  the  abstract 
thoery  will  be  consistent  if  the  original  theory  is; 
second,  for  each  abstract  theorem  T\  there  exists  a 
theorem  T  in  the  original  theory  for  which  T  is  an 
abstraction;  third,  the  model-theoretic  semantics  of 
the  abstract  theory  is  intuitively  appealing. 

In  section  2  we  present  the  formal  deBnition  of  a 
prtdieaU  mapping ,  and  show  in  section  3  both  its 
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relation  to  Plaisted’s  abstraction  mappings,  and  its 
inherent  problem  of  generating  inconsistencies  at  the 
abstract  level.  Section  4  provides  a  set  of  syntactic 
restrictions  that  satisfies  a  particular  semantics,  and 
Section  5  demonstrates  that  these  restrictions 
overcome  the  inconsistency  problem  by  showing 
constructively  that  a  model  exists  for  the  abstract 
theory  given  a  model  of  the  original  theory.  Section  6 
will  consist  of  a  small  example,  and  section  7  finishes 
with  concluding  remarks. 

2  Predicate  Mappings 

We  will  assume  standard  Brat-order  logic  terminology 
as  in  [Enderton  1972].  In  addition,  all  wffs  will  be 
taken  to  be  in  clause  form  [Robinson  1965].  Recall  that 
a  clause  is  a  set  of  literals,  and  that  clause  C  subsumes 
clause  D  if  there  exists  a  substitution  $  such  that  C$  is 
a  subset  of  D.  In  addition,  C  and  D  are  variants  if  C 
and  D  are  instances  of  one  another,  that  is,  they  are 
identical  except  for  a  renaming  of  variables. 

Predicate  mappings  are  functions  that  map  predicate 
symbols  from  one  first  order  language  to  those  of 
another.  Given  two  sets  of  predicates  Pi,  P2, 

f:Pl-*P2. 

where  the  Pj  are  the  only  symbols  that  possibly 
distinguish  the  Brat  order  languages  Li  and  L2.  What 
is  noteworthy  about  f  is  that  it  is  not  necessarily  1-1 , 
and  therefore  more  than  one  relation  symbol  from  Li 
can  map  to  the  same  relation  symbol  from  L2.  We  can 
then  define  f  over  literals  such  that  literals  in  Li  are 
mapped  to  literals  in  L2  by  replacing  the  predicate 
symbols  under  f.  Likewise,  f  can  be  extended  to  clauses 
and  sets  of  clauses  in  precisely  this  way. 

3  Plaisted’s  Abstraction  Mappings 

Predicate  mappings  are  a  subclass  of  abstraction 
mappings  that  are  deBned  in  [Plaisted  1981]: 

DEFINITION.  An  abstraction  is  an  association  of  a  set 
RC)  of  clauses  with  each  clause  C  such  that  f  has  the 
following  properties: 

(1)  If  clause  C3  is  a  resolvent  of  Cl  and  C2  and 
D3  €  BC3),  then  there  exist  D1  (  BC1)  and 

D2  (  BC2)  such  that  some  resolvent  of  D1  and  D2 
subsumes  D3. 

(2)  fiNIL)  s  {NIL}.  (NIL  is  the  empty  clause.) 

(3)  If  Cl  subsumes  C2,  then  for  every  abstraction  D2 
of  C2  there  is  an  abstraction  D1  of  Cl  such  that  D1 
subsumes  D2. 

If  f  is  a  mapping  with  these  properties,  then  we  call  f  an 
abstraction  mapping  of  clauses. 
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Ia  addition,  Plaisted  proves  the  following  theorem: 

THEOREM.  Suppose  <p  is  a  mapping  from  literals  to 
literals.  Let  us  extend  $  to  a  mapping  from  clauses  to 
clauses  by  $(C)  =  ($(L) :  L  t  C}.  Suppose  $  satisfies  the 
following  two  properties: 

(1)  4K  ->L)  =  ->($(L)).  That  is,  preserves 
complements. 

(2)  If  C  and  D  are  clauses  and  O  ia  an  instance  of  C, 
then  <p(D)  is  an  intance  of  $!C).  That  is,  $ 
preserves  instances. 

Then  $  is  an  abstraction  mapping. 

By  this  theorem,  predicate  mappings  are  abstraction 
mappings  since  they  preserve  both  complement  and 
instance. 


Abstraction  mappings  have  the  property  that  from 
every  clause  C  derivable  from  a  set  of  clauses  S,  there 
exists  a  clause  C*  derivable  from  RS)  that  subsumes  an 
abstraction  of  C  (proven  in  [Plaisted  1981]). 
Informally,  we  might  say  that  every  solution  in  the 
original  problem  space  has  a  corresponding  solution  in 
the  abstract  space.  This  will  be  termed  the  upward- 
solution  property. 


A  problem  with  abstraction  mappings,  however,  (and 
hence  with  predicate  mappings)  is  that  they  may  result 
in  abstract  theories  which  are  inconsistent  even  though 
the  original  theory  is  consistent,  which  Plaisted  termed 
the  "false  proof"  problem.  As  an  example,  suppose  we 
have  a  clause  set  for  a  simple  domain  in  whicn  we  have 
objects  that  are  bottles  ana  glasses,  with  clauses 
stating  that  they  are  disjoint  (note  that  all  clauses  will 
be  in  implication  form  for  ease  of  interpretation): 

1)  Bottle!  x)  3  ~<Glass(x), 

2)  Glass(x)  3  -’Bottle(x), 

and  the  clause  statipg  that  object  A  is  a  bottle: 

3)  Bottle(A). 

Hour  predicate  mapping  f  maps  both  Bottle  and  Glass 
to  a  new  predicate  Container  in  the  abstract  theory, 
then  from  the  abstract  theory  we  can  derive  a 
contradiction: 


Container^)  3  ->Containerix) 
ContaineriA) 


"•Container!  A) 


The  approach  taken  in  the  remainder  of  the  paper  is  to 
restrict  the  original  set  of  clauses  over  which  the 
predicate  mapping  can  be  applied  so  as  to  preserve 
consistency. 


4  Restricted  Predicate  Mappings 

The  intuition  behind  the  semantics  of  restricted 
predicate  mappings  is  that  we  would  like  the 
interpretation  of  a  predicate  in  the  abstract  theory  to 
be  the  union  of  the  interpretations  of  each  of  the 
predicates  in  the  original  theory  that  map  to  it  So  the 
objects  that  are  Containers  are  all  of  those  objects  that 
are  either  Bottles  or  Glasses,  or  any  of  the  other  things 
that  map  to  Container. 


We  can  obtain  this  interpretation  syntactically  by 
removing  all  of  those  axioms  from  the  original  theory 
that  serve  to  distinguish  the  relations  that  are 

conflated  at  the  abstract  level.  Relations  Ri . Rn  in 

axiomatization  0|,  can  be  made  indistinguishable  in 
axiomatization  62  by  systematically  replacing  each  R; 


with  a  new  symbol  R’  under  some  predicate  mapping  f 
as  before,  but  including  in  62  only  those  clauses  ftC) 
such  that  C  is  in  61  ana  every  clause  mapping  to  C  is 
derivable  from  81.  So,  for  instance,  if  we  wish  to 
conflate  the  relations  Glass  and  Bottle  by  mapping 
them  to  the  same  symbol,  say  Container,  then  given 
the  axioms 

Glass(x)  3  Breakable(x) 

Bottle(x)  3  Breakable(x) 

Bottle(x)  3  Corkable(x) 
we  would  have  in  the  new  clause  set  only 
Containerix)  3  Breakable(x), 
since  Corkability  is  true  only  of  Bottles,  and  therefore 
distinguishes  Bottles  from  Glasses  in  the  original 
theory,  while  Breakabiiitv  is  true  of  both  Bottles  and 
Glasses.  In  other  words,  the  clause 
Containerix)  3  Corkable(x) 
would  not  be  placed  in  the  new  clause  set  since  one  of 
the  clauses  that  maps  to  it,  namely 
Giass(x)  3  Corkable(x) 
is  not  derivable  from  the  original  clause  set. 

This  constraint,  however,  is  stronger  than  required. 
For  instance,  it  keeps  one  from  ever  asserting  in  the 
abstract  theory  that  an  object  is  a  container  in  theories 
in  which  bottles  and  glasses  are  disjoint,  as  in  (1)  -(3) 
above,  since  it  will  never  be  the  case  that  an  object  in 
the  original  theory  is  both  a  bottle  and  a  glass.  Given 
the  desired  semantics,  it  is  permissible  to  allow  the 
inclusion  of  mapped  positive  clauses  from  the  original 
theory,  those  clauses  containing  no  negated  literals, 
into  tne  abstract  theoiy.  That  is,  if  object  A  has 
property  P,  then  certainly  object  A  will  have  the 
abstract  property  flP),  and  likewise  for  predicates  of 
any  arity.  Thus,  one  can  map  the  clause  Bottle! A)  to 
ContaineriA),  since  all  bottles  are  containers,  even 
though  Glass! A)  may  not  hold.  In  addition,  this  allows 
the  mapping  of  clauses  such  as  Bottle(  A)  V  Glass!  A)  to 
ContaineriA)  V  ContaineriA). 

5  Formal  Definitions  and  Proof  of  Consistency 

We  use  [C]f  to  denote  the  set  of  symbols  or  clauses  that 
map  into  C  under  predicate  mapping  f.  That  is, 
fC]r={D|flD)  =  C}. 

Each  such  D  will  be  called  a  specialization  ofC.  If  6  is 
a  clause  set  and  D  is  a  clause  then  we  will  use  6  k  D  to 
mean  that  the  null  clause  can  be  derived  from  the 
clause  set  0  u  _,D  by  the  inference  rule  full  resolution 
[Robinson  1965].  Let  0  be  an  axiomatization  in  clause 
form  of  a  theory  over  language  Li  and  let  f  be  a 
predicate  mapping  from  Li  to  L2.  We  define  g!0),  a 
restricted  predicate  mapping,  to  be  a  subset  of  R8), 
where 

(4)g(0)  =  {C  |  there  exists  some  Dc[CJrsuch  that  DC  6 
and  either  C  is  a  positive  clause  or 
for  every  Dc[C]f  it  is  the  case  that  8t-D}. 
The  main  theorem  states  that  given  a  clause  form 
axiomatization  6  and  a  restricted  predicate  mapping  g, 
g(9)  is  consistent  if  0  is.  In  other  words,  consistency 1 
preserved  by  the  mapping.  Given  that  a  first-order 
clause  set  9  is  consistent  if  and  only  if  it  is  satisflable, 
i.e.,  there  exists  a  model  that  assigns  truth  to  all  of  the 
clauses  D  such  that  Bi-D,  this  theorem  will  be  proven 
by  constructing  a  model  of  g(8)  from  a  model  of  0. 

Standard  formal  semantic  terminology  will  be  assumed 
[Enderton  1972].  In  particular,  an  interpretation  is  a 
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atructure  <D,  G>,  where  D  is  a  set  of  objects,  the 
domain,  and  G  is  an  interpretation  function  that 
assigns  to  each  constant  symbol  an  object  from  D,  to 
each  n-function  symbol  an  n-function  mapping  objects 
in  D  to  an  object  in  0,  and  to  each  n-predicate  symbol 
an  n-relation  over  objects  in  D.  Further,  we  say  that  y 
is  a  oalue  assignment  if  it  assigns  to  every  variable 
symbol  an  object  in  D.  If  a  is  a  symbol  from  the 
language  over  which  G  is  defined,  we  say  that  G(o)  is 
the  interpretation  of  o  under  M,  which  in  the  case  of  an 
n-predicate  will  be  a  set  of  n-tuples.  In  addition, 
clauses  are  assigned  truth  values  according  to  semantic 
rules  pertaining  to  the  logical  symbols,  which  in  the 
case  of  clause  form  is  universal  quantification, 
negation,  and  disjunction.  If  a  is  a  clause  and  y  is  a 
value  assignment,  then  M(o)  returns  either  True  or 
False  under  y,  denoted  by  My(o).  IfMy(o)  =True  for 
every  value  assignment  y ,  then  we  say  that  o  is  True  in 
M,  or  M(o)=True,  otherwise  o  is  False  in  M.  An 
interpretation  M  =  <  D,  G  >  is  a  model  of  a  clause  set  0 
if  D  is  True  in  M  for  every  clause  0  such  that  0  h-  D. 

Theorem : 

Let  0  be  an  axiomatization  in  clause  form  of  a  theory 
over  language  Li,  let  f  be  a  predicate  mapping  from  Lj 
to  Li,  and  let  g  be  the  restricted  predicate  mapping 
defined  relative  to  f  as  in  (4)  above.  For  any  model 
M=<D,  G>  of0,M’=<D,G’>  is  a  model  of  g(0) 
where  for  every  predicate  symbol  o  of  L2, 

G'fa)  =  (J  Gffl. 

and  for  every  non-predicate  symbol  o,  G’(o) =G(o). 
Proof. 

Let  us  assume  that  M*  is  not  a  model  of  g<0).  There 
must  then  be  some  clause  C  €  gf  0)  that  is  False  in  M*. 
There  are  two  cases  to  consider  -  first  where  C  is  a 
positive  clause,  and  second  where  C  contains  negated 
literals. 

Case  1 :  Assume  C  is  a  positive  clause. 

For  membership  of  C  in  g(0),  it  must  be  that  there 
exists  some  clause  E  €  [C]f  such  that  E  «  0,  by  the 
definition  of  g.  Since  M  is  a  model  of  0,  E  is  True  in  M. 
Then,  for  every  value  assignement  y,  My(E) =True. 
Let  us  take  truth  in  M  and  NT  relative  to  a  particular 
value  assignment  y.  E  is  of  the  form 
E = (Pi(a  1 ), ....  P„(a„)},  where  ai  represents  the 
predicate  Pi’s  list  of  arguments.  My(E)=True  means 
that  My(Pi(ai))  =  True  for  some  literal  of  E.  Since  ai 
represents  the  terms  (an, ....  ay)  of  Pi,  then 
<G(aii),...,  Gfaii)>  f  G(Pj).  Additionally  it  follows 
that  cG’(aji) ,...,  G’(ay)>  €  G’(flPi))  under  v,  by  the 
definitions  of  the  interpretation  function  Gr  given  in 
(5)  above,  and  the  predicate  ipapping  f.  That  is,  for 
Q=flPj),  M’vfQfaii, ....  ajj))=True.  But  this  literal  is 
in  C,  and  so  M*y(C)  =True.  Therefore,  hTyfC) =True 
for  every  value  assignment  y  since  y  was  chosen 
arbitrarily,  making  M*(C) =True,  contradicting  our 
assumption.  This  dispatches  the  first  case. 

Case  2:  Assume  C  contains  negated  literals. 

For  membership  of  C  in  g(0),  for  every  clause  E  €  (C]f, 

0 1-  E  and  some  E  i  0,  by  the  definition  of  g.  Since  M  is 
a  model  of  0,  M(E) =True  for  every  such  E,  and 
therefore  My(E) =True  for  every  value  assignment  y. 

Since  C  is  False  in  M\  there  must  exist  a  value 
assignment  y  such  that  M’yfOs  False.  Let  us  take 
truth  in  M  and  M1  relative  to  this  value  assignment  y. 


C  is  of  the  form  C={Ri(ai), ...,  Rn(an),  ~»Si(bi), 
Sk(bk)},  where  at  and  b{  are  the  arguments  of 
each  of  the  literals.  That  is,  some  of  the  literals  in  C 
are  negated,  and  some  may  be  positive.  For  every 
literal  L  (  C,  \T y(L)  =  False.  This  means  that  for 
every  E  (  [C]f,  for  every  positive  literal  W(E, 

My(W)  =  False,  since  were  it  not,  then 
M’y(gfW))=True  for  some  literal  W  and  hence 
hTyiC) =True  by  the  definition  of  G\  contradicting  our 
assumption  that  NTyfC)  =  False.  Therefore,  under  this 
value  assignment  one  or  more  of  the  negated  literals 
in  each  E  (  [C]f  must  be  True. 

Since  each  — »Si(bi)  is  False  in  MP  under  y,  then  each 
Si(bi)  is  True.  That  is,  taking  bt  to  represent  the 
ground  terms  (bii>  •••.  b;])  of  each  ~»Sj, 

<  G’(bii) ,...,  G’(bij)>  (  G’(Sj)  for  each  of  the  negated 
literals  under  y.  But  by  the  definition  of  G’,  for  each  of 
the  Si’s  there  exists  some  Qi  t  [Silf  where 
<G(bji),...,G(by)>  (  G(Qi)  under  this  same  value 
assignment.  That  is,  My(  Qj(b{)) = False  for  some 
specialization  Qi  of  each  predicate  Si  of  each  negated 
literal  in  C.  Let  E  be  the  clause  consisting  of  the 
negation  of  each  such  Qi(b|)  and  some  specialization  of 
each  of  the  positive  literals  in  C.  Note  that  E  i  [C]f. 
My(E)  =  False,  since  each  of  the  literals  in  E  are  False. 
This  then  contradicts  the  earlier  conclusion  that  every 
E  f  [C]f  be  assigned  True  by  M  for  every  value 
assignment  v.  This  dispatches  case  2,  and  hence 
establishes  the  theorem,  qed 

Restricted  predicate  mappings  are  no  longer 
abstraction  mappings  by  Plaisted’a  definition,  since  the 
upward-solution  property  is  not  preserved.  However, 
abandoning  the  completeness  afforded  by  the  upward- 
solution  property  is  likely  what  is  required  to  solve 
frequently  encountered  problems  at  a  lower  cost  than  if 
they  were  solved  directly  in  the  original  theory.  These 
restricted  mappings  do  have  what  can  be  termed  the 
downward-solution  property,  since  a  trivial  corollary  of 
the  theorem  is  that  for  every  clause  derivable  Bom  the 
abstract  theory,  there  will  be  a  specialization  of  it 
derivable  Bom  the  original  theory.  Note  that  there 
may  be  no  solution  to  the  problem  in  the  abstract 
theory  since  its  solution  requires  some  of  the  details 
that  are  ignored  at  that  level.  However,  every  abstract 
solution  is  guaranteed  to  have  a  specialized  solution  to 
the  original  problem. 

An  important  objection  that  one  might  have  is  that  the 
syntactic  mapping  function  g  given  in  (4)  is 
undecidable,  since  it  is  based  upon  determining  0  1-  D 
for  every  clause  D  mapping  to  each  candidate  clause  in 
the  abstract  clause  set  One  should  note,  however,  that 
the  search  for  derivability  can  always  be  arbitrarily 
bounded,  and  if  no  proof  is  obtained  within  this  bound 
then  it  can  be  assumed  that  this  clause  is  not  derivable. 
In  this  way,  consistency  is  still  preserved  between  the 
original  and  the  abstract  theory,  the  abstract  theory 
being  simply  weaker  than  it  theoretically  could  be,  i.e., 
having  fewer  theorems.  The  appropriate  amount  of 
resource  that  one  should  expend  in  constructing 
abstract  theories  is  quite  an  important  issue  but 
outside  the  bounds  of  this  paper.  These  kinds  of 

Brformance  choices,  however,  are  discussed  in 
artman  and  Tenenberg  1987],  elsewhere  in  this 
volume. 
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6  Example 

Let  0,  the  original  clause  set,  be  the  following  clauses 
(note  that  A  D  B  is  equivalent  to  -•  A  v  B,  and  is 
therefore  not  a  positive  clause): 

I)  Bottlefx)  3  MadeOfGlais(x)  2)  Bottlelx)  3  Graspablefx) 

3)  Glaxsfx)  d  MadeOfGlassfx)  4)  Glass(x)  3  Graspablefx) 

5)  Glaasfx)  3  Open(x)  61  Boxfx)  3  Graspablefx) 

7)  Bottlefx)  3  ->Gla**(x)  S)  Glass!  x)  3  -  Bottle!  x) 

9)  Bottle! x)  3  -•Boxfx)  10)  Glassfx)  3  -Boxfx) 

II)  Box(x)  3  -Glasafx)  12)  Box(x)  3  -Bottlefx) 

13)  Bottle<x)  3  MilkBottlefx)  v  Winehottlef  x) 

14)  Open(x)  a  Graapable<x)  3  Pourablef  x) 

15)  Graspablefx)  3  Movable{x) 

16)  MadeOfGlassfx)  3  Breakablefx) 

17)  Bottlefx)  v  G!aaa(x)  v  Box(x) 

18)  Opan(A) 

Let  fi  map  each  predicate  symbol  to  itself  except  that  it 
maps  Bottle  and  Glass  both  to  GlassContainer. 
Therefore,  gi(6)  is  the  following: 

O  GlasaContainerfx)  3  MadeOfGlassfx) 

20  GlassContainerfx)  3  Graspablefx) 

60  Boxfx)  3  Graspablefx) 

90  GlassContainerfx)  3  -Boxfx) 

110  Box(x)  3  -GlassContainerfx) 

140  Open(x)  A  Graspablefx)  3  Pourable<x) 

150  Graapable(x)  3  Movablefx) 

160  MadeOfGlassfx)  3  Breakablefx) 

170  GlassContainerfx)  v  Boxfx) 

180  Open(A) 

Note  that  clauses  3  and  4  are  redundant  in  the 
abstract  theory  and  are  therefore  not  included  in  gi(0), 
as  are  10  and  12.  Clause  5  gets  eliminated  because 
Bottle(x)  D  Openfx)  is  not  derivable  from  6.  Likewise 
with  7  and  8  as  seen  earler.  Additionally,  17*  was 
factored  so  that  the  same  literal  did  not  appear  twice. 
Although  gi(6)  is  much  smaller  than  0,  there  are  still 
several  intuitive  inferences  that  can  be  made,  such  as 
Graspablefx),  Movablefx)  and  Pourablef  A). 

Given  the  way  in  which  abstraction  is  defined,  gt(0) 
can  additionally  be  treated  as  a  primitive  level  theory 
and  mapped  into  a  more  abstract  theory  g2<gi(0)).  For 
instance,  let  h  map  all  predicates  to  themselves  except 
Box  and  GlassContainer  get  mapped  to  Container. 
Thus,  g2(gi(0))  is: 

2")  Containerfx)  3  Graspablefx) 

14*0  Openfx)  A  Graspablefx)  3  Pourablef  x) 

15*0  Graspablefx)  3  Movablefx) 

16*0  MadeOfGlassfx)  3  Breakablefx) 

17*0  Containerfx) 

18")  OpenfA) 

One  can  therefore  abstract  a  theory  arbitrarily  many 
times,  preserving  consistency  throughout.  Each 
abstract  theory  has  the  advantage  that  those  clauses 
that  it  can  derive  will  likely  be  done  so  at  considerably 
less  cost  than  in  the  original  theory.  Although  this 
would  be  true  of  any  subset  of  0,  it  is  important  to 
recognize  that  the  mapping  functions  embed  a 
significant  amount  of  neunstic  information  about  the 
domain.  This  function  determines  which  clauses  are 
important,  and  which  are  mere  details,  and  so  the 


choice  about  which  clauses  remain  in  the  abstract 
theory  is  not  done  so  on  an  arbitrary  basis. 

7  Conclusion 

A  useful  way  of  viewing  abstraction  is  as  a  function 
that  maps  one  first-order  theory  into  another.  We  have 
defined  predicate  mappings  that  involve  the  renaming 
of  predicate  symbols,  where  more  than  one  of  the 
original  symbols  can  map  into  the  same  abstract 
symbol.  These  predicate  mappings  are  a  subclass  of 
Plaisted’s  abstraction  mappings,  and  as  such  suffer 
from  the  fact  that  consistency  in  the  original  theory 
may  be  lost  in  the  abstract  theory.  To  remedy  this,  we 
introduced  syntactic  restrictions  to  the  predicate 
mapping  that  guarantee  the  preservation  of 
consistency.  These  restrictions  amount  to  eliminating 
the  axioms  in  the  original  theory  that  distinguish  the 
predicates  which  are  being  conflated  in  the  abstract 
theory.  This  has  a  simple  semantics;  the  extension  of 
each  of  the  predicates  P  in  a  model  of  the  abstract 
theory  can  be  constructed  from  the  union  of  the 
extensions  of  each  of  the  predicates  that  map  to  P  in 
any  model  of  the  original  theory.  A  proof  of  this  was 
provided,  along  with  an  example  demonstrating  one 
possible  use  of  the  restricted  predicate  mappings. 
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